SEQUENCE LISTING 



Mb 



<110> Wang, Danher 

<12 0> GENETIC VACCINE AGAINST HUMAN IMMUNODEFICIENCY VIRUS 

<130> 22488-712 

<150> 09/585,599 

<151> 2000-06-02 

<150> PCT/US01/18238 

<151> 2001-06-04 

<160> 75 

<170> Patentln version 3.1 

<210> 1 

<211> 7 

<212> DNA 

<213> Ebola virus 

<400> 1 

ttttttt 



^! <210> 2 

W <211> 7 



<212> RNA 

<213> Ebola virus 

<400> 2 
uuuuuuu 



<210> 3 

<211> 8 

<212> RNA 

<213> Artificial sequence 
<220> 

<223> Modified RNA editing site 

<400> 3 
uucuucuu 



<210> 4 

<211> 7 

<212> DNA 

<213> Ebola virus 

<400> 4 
aaaaaaa 



<210> 5 

<211> 8 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> DNA of modified RNA editing site 

<400> 5 
aagaagaa 



<210> 6 

<211> 6 

<212> RNA 

<213> Ebola virus 

<400> 6 
uuuuuu 



<210> 7 

<211> 6 

<212> RNA 

<213> Artificial sequence 
<220> 

<223> Modified RNA editing site 

<400> 7 
uucuuc 



<210> 8 

<211> 6 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> DNA of modified RNA editing site 

<400> 8 
ttcttc 



<210> 9 

<211> 21 

<212> PRT 

<213> Homo sapiens 

<400> 9 

Ser Trp Leu Leu Leu Leu Leu Leu Ser Leu Ser Leu Leu Gin Ala Thr 
15 10 15 



Asp Phe Met Ser Leu 
20 



<210> 10 

<211> 5 

<212> PRT 

<213> Ebola virus 



<400> 10 



Arg Arg Thr Arg Arg 
1 ^ 5 



<210> 11 

<211> 4 

<212> PRT 

<213> Human immunodeficiency virus type 1 

<400> 11 

Arg Glu Lys Arg 
1 



<210> 


12 




<211> 


9 




<212> 


DNA 




<213> 


Artificial 


sequence 


<220> 






<223> 


DNA of GAG 


site 


<400> 


12 




ggagctggt 





<210> 13 

<211> 3 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> GAG site 

<400> 13 

Gly Ala Gly 
1 



<210> 14 
<211> 3157 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Env/Tat/Rev 
<400> 14 

gaattctgca acaactgctg tttatccatt ttcagaattg ggtgtcgaca tagcagaata 60 
ggcgttactc gacagaggag agcaagaaat ggagccagta gatcctagac tagagccctg 12 0 
gaagcatcca ggaagtcagc ctaaaactgc ttgtaccaat tgctattgta aaaagtgttg 180 
ctttcattgc caagtttgtt tcataacaaa agccttaggc atctcctatg gcaggaagaa 24 0 
gcggagacag cgacgaagac ctcctcaagg cagtcagact catcaagttt ctctatcaaa 3 00 



w 



gcagtaagta 


gtacatgtaa 


tgcaacctat 


acaaatagca 


atagtagcat 


tagtagtagc 


360 


aataataata 


gcaatagttg 


tgtggtccat 


agtaatcata 


gaatatagga 


aaatattaag 


420 


acaaagaaaa 


atagacaggt 


taattgatag 


actaatagaa 


agagcagaag 


acagtggcaa 


480 


tgagagtgaa 


ggagaaatat 


cagcacttgt 


ggagatgggg 


gtggagatgg 


ggcaccatgc 


540 


tccttgggat 


gttgatgatc 


tgtagtgcta 


cagaaaaatt 


gtgggtcaca 


gtctattatg 


600 


gggtacctgt 


gtggaaggaa 


gcaaccacca 


ctctattttg 


tgcatcagat 


gctaaagcat 


660 


atgatacaga 


ggtacataat 


gtttgggcca 


cacatgcctg 


tgtacccaca 


gaccccaacc 


720 


cacaagaagt 


agtattggta 


aatgtgacag 


aaaattttaa 


catgtggaaa 


aatgacatgg 


780 


tagaacagat 


gcatgaggat 


ataatcagtt 


tatgggatca 


aagcctaaag 


ccatgtgtaa 


840 


aattaacccc 


actctgtgtt 


agtttaaagt 


gcactgattt 


gaagaatgat 


actaatacca 


900 


atagtagtag 


cgggagaatg 


ataatggaga 


aaggagagat 


aaaaaactgc 


tctttcaata 


960 


tcagcacaag 


cataagaggt 


aaggtgcaga 


aagaatatgc 


atttttttat 


aaacttgata 


1020 


taataccaat 


agataatgat 


actaccagct 


atacgttgac 


aagttgtaac 


acctcagtca 


1080 


ttacacaggc 


ctgtccaaag 


gtatcctttg 


agccaattcc 


catacattat 


tgtgccccgg 


1140 


ctggttttgc 


gattctaaaa 


tgtaataata 


agacgttcaa 


tggaacagga 


ccatgtacaa 


1200 


atgtcagcac 


agtacaatgt 


acacatggaa 


ttaggccagt 


agtatcaact 


caactgctgt 


1260 


taaatggcag 


tctggcagaa 


gaagaggtag 


taattagatc 


tgccaatttc 


acagacaatg 


1320 


ctaaaaccat 


aatagtacag 


ctgaaccaat 


ctgtagaaat 


taattgtaca 


agacccaaca 


1380 


acaatacaag 


aaaaagtatc 


cgtatccaga 


gaggaccagg 


gagagcattt 


gttacaatag 


1440 


gaaaaatagg 


aaatatgaga 


caagcacatt 


gtaacattag 


tagagcaaaa 


tggaataaca 


1500 


ctttaaaaca 


gatagatagc 


aaattaagag 


aacaatttgg 


aaataataaa 


acaataatct 


1560 


ttaagcagtc 


ctcaggaggg 


gacccagaaa 


ttgtaacgca 


cagttttaat 


tgtggagggg 


1620 


aatttttcta 


ctgtaattca 


acacaactgt 


ttaatagtac 


ttggtttaat 


agtacttgga 


1680 


gtactaaagg 


gtcaaataac 


actgaaggaa 


gtgacacaat 


caccctccca 


tgcagaataa 


1740 


aacaaattat 


aaacatgtgg 


caggaagtag 


gaaaagcaat 


gtatgcccct 


cccatcagtg 


1800 


gacaaattag 


atgttcatca 


aatattacag 


ggctgctatt 


aacaagagat 


ggtggtaata 


1860 


gcaacaatga 


gtccgagatc 


ttcagacctg 


gaggaggaga 


tatgagggac 


aattggagaa 


1920 


gtgaattata 


taaatataaa 


gtagtaaaaa 


ttgaaccatt 


aggagtagca 


cccaccaagg 


1980 


caaagagaag 


agtggtgcag 


agagaaaaaa 


gagcagtggg 


aataggagct 


ttgttccttg 


2040 


ggttcttggg 


agcagcagga 


agcactatgg 


gcgcagcgtc 


aatgacgctg 


acggtacagg 


2100 



M 



ccagacaatt 


attgtctggt 


atagtgcagc 


agcagaacaa tttgctgagg gctattgagg 


2160 


cgcaacagca 


tctgttgcaa 


ctcacagtct 


ggggcatcaa gcagctccag gcaagaatcc 


2220 


tqqctqtqqa 


aagataccta 


aaggatcaac 


agctcctggg gatttggggt 


tgctctggaa 


2280 


aactcatttg 


caccactgct 


gtgccttgga 


atgctagttg gagtaataaa 


tctctggaac 


2340 


agatt tggaa 


taacatgacc 


tcTcratcrcracrt 


gggacagaga aattaacaat 


tacacaagct 


2400 


taatacactc 


cttaattgaa 


gaatcgcaaa 


accagcaaga aaagaatgaa 


caagaattat 


2460 


tggaattaga 


taaatgggca 


aatttQtacra 


attggtttaa cataacaaat tggctgtggt 


2520 


atataaaatt 


attcataatg 


ataqtaaaaQ 

*~ ZJ Zj ZJ 


gcttggtagg tttaagaata gtttttgctg 


2580 


tactttctgt 


agtgaataga 


qttaqqcaqq 

ZJ ^ ZJZJ^ ZJZ) 


gatattcacc attatcgttt 


cagacccacc 


2640 


tcccaatccc 


gaggggaccc 


gacaggcccg 


aaggaataga agaagaaggt 


ggagagagag 


2700 


acagagacag 


atccattcga 


ttagtgaacg 


gatccttagc acttatctgg 


gacgatctgc 


2760 




cctct tcacrc 


fcaccacccrct 

W V* W W W4 V*» W W W 


tgagagactt actcttgatt 


gtaacgagga 


2820 


ttotcrcraact 


t c t crcjcra c crc 


aqqqqqtqqq 


aagccctcaa atattggtgg 


aatctcctac 


2880 


agtattggag 


tcaggagcta 


aagaatagtg 


ctgttagctt gctcaatgcc 


acagctatag 


2940 


cagtagctga 


ggggacagat 


agggttatag 


aagtagtaca aggagcttat 


agagctattc 


3000 


gccacatacc 


tagaagaata 


agacagggct 


tggaaaggat tttgctataa 


gatgggtggc 


3060 


aagtggtcaa 


aaagtagtgt 


ggttggatgg 


cctgctgtaa gggaaagaat 


gagacgagct 


3120 


gagccagcag 


cagatggggt 


gggagcagca 


tctcgag 




3157 


<210> 15 
<211> 508 
<212> DNA 

<213> Artificial sequence 








<220> 

<223> Modified IL-2 










<400> 15 
tcactctctt 


taatcactac 


tcacagtaac 


ctcaactcct gccacaatgt 


acaggatgca 


60 


actcctgtct 


tgcattgcac 


taagtcttgc 


acttgtcaca aacagtgcac 


ctacttcaag 


120 


ttctacaaag 


aaaacacagc 


tacaactgga 


gcatttactg ctggatttac 


agatgatttt 


180 


gaatggaatt 


aataattaca 


agaatcccaa 


actcaccagg atgctcacat 


ttaagtttta 


240 


catgcccaag 


aaggccacag 


aactgaaaca 


tcttcagtgt cttgaagaag 


aactcaaacc 


300 


tctggaggaa 


gtgctaaatt 


tagctcaaag 


caaaaacttt cacttaagac 


ccagggactt 


360 


aatcagcaat 


atcaacgtaa 


tagttctgga 


actaaaggga tctgaaacaa 


cattcatgtg 


420 


tgaatatgct 


gatgagacag 


caaccattgt 


agaatttctg aacagatgga 


ttaccttttg 


480 



tcaaagcatc atctcaacac taacttga 



508 



<210> 16 
<211> 2280 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Modified Env 
<400> 16 

gaattcgcca ccatgggagt gaaggagaaa tatcagcact tgtggagatg ggggtggaga 60 

tggggcacca tgctccttgg gatgttgatg atctgtagtg ctacagaaaa attgtgggtc 12 0 

acagtctatt atggggtacc tgtgtggaag gaagcaacca ccactctatt ttgtgcatca 180 

gatgctaaag catatgatac agaggtacat aatgtttggg ccacacatgc ctgtgtaccc 24 0 

acagacccca acccacaaga agtagtattg gtaaatgtga cagaaaattt taacatgtgg 300 

aaaaatgaca tggtagaaca gatgcatgag gatataatca gtttatggga tcaaagccta 360 

aagccatgtg taaaattaac cccactctgt gttagtttaa agtgcactga tttgaagaat 420 

gatactaata ccaatagtag tagcgggaga atgataatgg agaaaggaga gataaaaaac 480 

tgctctttca atatcagcac aagcataaga ggtaaggtgc agaaagaata tgcatttttt 540 

tataaacttg atataatacc aatagataat gatactacca gctatacgtt gacaagttgt 600 

aacacctcag tcattacaca ggcctgtcca aaggtatcct ttgagccaat tcccatacat 660 

tattgtgccc cggctggttt tgcgattcta aaatgtaata ataagacgtt caatggaaca 720 

ggaccatgta caaatgtcag cacagtacaa tgtacacatg gaattaggcc agtagtatca 780 

actcaactgc tgttaaatgg cagtctggca gaagaagagg tagtaattag atctgccaat 84 0 

ttcacagaca atgctaaaac cataatagta cagctgaacc aatctgtaga aattaattgt 900 

acaagaccca acaacaatac aagaaaaagt atccgtatcc agagaggacc agggagagca 960 

tttgttacaa taggaaaaat aggaaatatg agacaagcac attgtaacat tagtagagca 1020 

aaatggaata acactttaaa acagatagat agcaaattaa gagaacaatt tggaaataat 1080 

aaaacaataa tctttaagca gtcctcagga ggggacccag aaattgtaac gcacagtttt 1140 

aattgtggag gggaattttt ctactgtaat tcaacacaac tgtttaatag tacttggttt 1200 

aatagtactt ggagtactaa agggtcaaat aacactgaag gaagtgacac aatcaccctc 1260 

ccatgcagaa taaaacaaat tataaacatg tggcaggaag taggaaaagc aatgtatgcc 132 0 

cctcccatca gtggacaaat tagatgttca tcaaatatta cagggctgct attaacaaga 1380 

gatggtggta atagcaacaa tgagtccgag atcttcagac ctggaggagg agatatgagg 144 0 



gacaattgga 


gaagtgaatt 


atataaatat 


aaagtagtaa 


aaattgaacc 


attaggagta 


1500 


gcacccacca 


aggcaaagag 


aagagtggtg 


cagactagtg 


cagtgggaat 


aggagctttg 


1560 


ttccttgggt 


tcttgggagc 


agcaggaagc 


actatgggcq 


cagcgtcaat 

ZJ ZJ 


gacgctgacg 


1620 


gtacaggcca 


gacaattatt 


gtctggtata 


gtgcagcagc 


agaacaattt 


gctgagggct 


1680 


attgaggcgc 


aacagcatct 


gttgcaactc 


acagtctggg 


gcatcaagca 


gctccaggca 


1740 


agaatcctgg 


ctgtggaaag 


atacctaaag 


gatcaacagc 


tcctggggat 


ttggggttgc 


1800 


tctggaaaac 


tcatttgcac 


cactgctgtg 


ccttggaatg 


ctagttggag 


taataaatct 


1860 


ctggaacaga 


tttggaataa 


catgacctgg 


atggagtggg 

ZJ Z, 1 ZJ ZJ ZJ ZJ 


acagagaaat 


taacaattac 


1920 


acaagcttaa 


tacactcctt 


aattgaagaa 


tcgcaaaacc 


agcaagaaaa 


gaatgaacaa 


1980 


gaattattgg 


aattagataa 


atgggcaagt 


ttgtggaatt 


ggtttaacat 


aacaaattgg 


2040 


ctgtggtata 


taaaattatt 


cataatgata 


gtaggaggct 


tggtaggttt 


aagaatagtt 


2100 


tttgctgtac 


tttctgtagt 


gaatagagtt 


aggcagggat 


attcaccatt 


atcgtttcag 


2160 


acccacctcc 


caatcccgag 


gggacccgac 


aggcccgaag 


gaatagaaga 


agaaggtgga 


2220 


gagagagaca 


gagacagatc 


cattcgatta 


gtgaacggat 


ccttagcact 


tatctggtaa 


2280 


<210> 17 
<211> 1496 
<212> DNA 

<213> Artificial sequence 










<220> 

<223> Full length Gag 










<400> 17 
ggctagaagg 


agagaggatg 


ggtgcgagag 


cgtcagtatt 


aagcggggga 


gaattagatc 


60 


gatgggaaaa 


aattcggtta 


aggccagggg 


gaaagaaaaa 


atataaatta 


aaacatatag 


120 


tatgggcaag 


cagggagcta 


gaacgactac 


aaccatccct 


tcagacagga 


tcagaagaac 


180 


ttagatcatt 


atataataca 


gtagcaaccc 


tctattgtgt 


gcatcaaagg 


atagagataa 


240 


aagacaccaa 


ggaagcttta 


gacaagatag 


aggaagagca 


aaacaaaagt 


aagaaaaaag 


300 


cacagcaagc 


agcagctgac 


acaggacaca 


Qcacrtcacrcrt 

zj zj ^^^zjzj 


cagccaaaat 


taccctatag 


360 


tgcagaacat 


ccaggggcaa 


atggtacatc 


aggccatatc 


acctagaact 


ttaaatgcat 


420 


gggtaaaagt 


agtagaagag 


aaggctttca 


gcccagaagt 


aatacccatg 


ttttcagcat 


480 


tatcagaagg 


agccacccca 


caagatttaa 


acaccatgct 


aaacacagtg 


gggggacatc 


540 


aagcagccat 


gcaaatgtta 


aaagagacca 


tcaatgagga 


agctgcagaa 


tgggatagag 


600 


tacatccagt 


gcatgcaggg 


cctattgcac 


caggccagat 


gagagaacca 


aggggaagtg 


660 


acatagcagg 


aactactagt 


acccttcagg 


aacaaatagg 


atggatgaca 


aataatccac 


720 



ctatcccagt 


aggagaaatt 


tataaaagat 


ggataatcct 


gggattaaat 


aaaatagtaa 


780 


gaatgtatag 


ccctaccagc 


attctggaca 


taagacaagg 


accaaaagaa 


ccttttagag 


840 


actatgtaga 


ccggttctat 


aaaactctaa 


gagccgagca 


agcttcacag 


gaggtaaaaa 


900 


attggatgac 


agaaaccttg 


ttggtccaaa 


atgcgaaccc 


agattgtaag 


actattttaa 


960 


aagcattggg 


accagcggct 


acactagaag 


aaatgatgac 


agcatgtcag 


ggagtaggag 


1020 


gacccggcca 


taaggcaaga 


gttttggctg 


aagcaatgag 


ccaagtaaca 


aatacagcta 


1080 


ccataatgat 


gcagagaggc 


aattttagga 


accaaagaaa 


gatggttaag 


tgtttcaatt 


1140 


gtggcaaaga 


agggcacaca 


gccagaaatt 


gcagggcccc 


taggaaaaag 


ggctgttgga 


1200 


aatgtggaaa 


ggaaggacac 


caaatgaaag 


attgtactga 


gagacaggct 


aattttttag 


1260 


ggaagatctg 


gccttcctac 


aagggaaggc 


cagggaattt 


tcttcagagc 


agaccagagc 


1320 


caacagcccc 


accatttctt 


cagagcagac 


cagagccaac 


agccccacca 


gaagagagct 


1380 


tcaggtctgg 


ggtagagaca 


acaactcccc 


ctcagaagca 


ggagccgata 


gacaaggaac 


1440 


tgtatccttt 


aacttccctc 


agatcactct 


ttggcaacga 


cccctcgtca 


caataa 


1496 



<210> 18 

O <211> 492 

W <212> PRT 

HI <213> Human immunodeficiency virus type 1 

L <400> 18 

Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp 
0 1 5 10 15 

IT? Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys 

r ~ ~ 20 ~ " 25 30 



His lie Val Trp Ala Ser Arg Glu Leu Glu Arg Leu Gin Pro Ser Leu 
35 40 45 



Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala Thr 
50 55 60 



Leu Tyr Cys Val His Gin Arg lie Glu lie Lys Asp Thr Lys Glu Ala 
65 70 75 80 



Leu Asp Lys lie Glu Glu Glu Gin Asn Lys Ser Lys Lys Lys Ala Gin 
85 90 95 



Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val Ser Gin Asn Tyr 
100 105 110 



Pro He Val Gin Asn He Gin Gly Gin Met Val His Gin Ala He Ser 
115 120 125 



Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe 
13 0 135 140 



Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr 
145 150 155 " 160 



Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala 
165 170 175 



Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp 
180 185 190 



Asp Arg Val His Pro Val His Ala Gly Pro He Ala Pro Gly Gin Met 
195 200 205 



Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu Gin 
210 215 220 



Glu Gin He Gly Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu 
225 230 235 240 



He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys He Val Arg Met 
245 250 255 



Tyr Ser Pro Thr Ser He Leu Asp He Arg Gin Gly Pro Lys Glu Pro 
260 265 270 



Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin 
275 * 280 285 



Ala Ser Gin Glu Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin 
290 295 300 



Asn Ala Asn Pro Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro Ala 
305 310 315 320 



Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro 
325 330 335 



Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin Val Thr Asn 
340 345 350 



Thr Ala Thr lie Met Met Gin Arg Gly Asn Phe Arg Asn Gin Arg Lys 
355 360 " 365 



Met Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His Thr Ala Arg Asn 
370 375 380 



Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly 
385 390 395 400 



His Gin Met Lys Asp Cys Thr Glu Arg Gin Ala Asn Phe Leu Gly Lys 
405 410 415 



lie Trp Pro Ser Tyr Lys Gly Arg Pro Gly Asn Phe Leu Gin Ser Arg 
420 ~ ~ 425 430 

Pro Glu Pro Thr Ala Pro Pro Phe Leu Gin Ser Arg Pro Glu Pro Thr 
435 440 445 

p Ala Pro Pro Glu Glu Ser Phe Arg Ser Gly Val Glu Thr Thr Thr Pro 

SI 450 455 460 

1 ':; i 

Pro Gin Lys Gin Glu Pro He Asp Lys Glu Leu Tyr Pro Leu Thr Ser 
Uf 465 470 475 480 



n 



Leu Arg Ser Leu Phe Gly Asn Asp Pro Ser Ser Gin 







485 




490 








<210> 
<211> 
<212> 
<213> 


19 

2941 
DNA 

Artificial sequence 










<220> 
<223> 


Modified Env from HIV strain pNL4-3 








<400> 19 
gaattctgca 


acaactgctg 


tttatccatt 


tcagaattgg 


gtgtcgacat 


agcagaatag 


60 


gcgttactcg 


acagaggaga 


gcaagaaatg 


gagccagtag 


atcctagact 


agagccctgg 


120 


aagcatccag 


gaagtcagcc 


taaaactgct 


tgtaccaatt 


gctattgtaa 


aaagtgttgc 


180 


tttcattgcc 


aagtttgttt 


catgacaaaa 


gccttaggca 


tctcctatgg 


caggaagaag 


240 


cggagacagc 


gacgaagagc 


tcatcagaac 


agtcagactc 


atcaagcttc 


tctatcaaag 


300 


cagtaagtag 


tacatgtaat 


gcaacctata 


atagtagcaa 


tagtagcatt 


agtagtagca 


360 


ataataatag 


caatagttgt 


gtggtccata 


gtaatcatag 


aatataggaa 


aatattaaga 


420 


caaagaaaaa 


tagacaggtt 


aattgataga 


ctaatagaaa 


gagcagaaga 


cagtggcaat 


480 



gagagtgaag 


gagaagtatc 


agcacttgtg 


gagatggggg 


tggaaatggg 


gcaccatgct 


540 


ccttgggata 


ttgatgatct 


gtagtgctac 


agaaaaattg 


tgggtcacag 


tctattatgg 


600 


ggtacctgtg 


tggaaggaag 


caaccaccac 


tctattttgt 


gcatcagatg 


ctaaagcata 


660 


tgatacagag 


gtacataatg 


tttgggccac 


acatgcctgt 


gtacccacag 


accccaaccc 


720 


acaagaagta 


gtattggtaa 


atgtgacaga 


aaattttaac 


atgtggaaaa 


atgacatggt 


780 


agaacagatg 


catgaggata 


taatcagttt 


atgggatcaa 


agcctaaagc 


catgtgtaaa 


840 


attaacccca 


ctctgtgtta 


gtttaaagtg 


cactgatttg 


aagaatgata 


ctaataccaa 


900 


tagtagtagc 


gggagaatga 


taatggagaa 


aggagagata 


aaaaactgct 


ctttcaatat 


960 


cagcacaagc 


ataagagata 


aggtgcagaa 


agaatatgca 


ttcttttata 


aacttgatat 


1020 


agtaccaata 


gataatacca 


gctataggtt 


gataagttgt 


aacacctcag 


tcattacaca 


1080 


ggcctgtcca 


aaggtatcct 


ttgagccaat 


tcccatacat 


tattgtgccc 


cggctggttt 


1140 


tgcgattcta 


aaatgtaata 


ataagacgtt 


caatggaaca 


ggaccatgta 


caaatgtcag 


1200 


cacagtacaa 


tgtacacatg 


gaatcaggcc 


agtagtatca 


actcaactgc 


tgttaaatgg 


1260 


cagtctagca 


gaagaagatg 


tagtaattag 


atctgccaat 


ttcacagaca 


atgctaaaac 


1320 


cataatagta 


cagctgaaca 


catctgtaga 


aattaattgt 


acaagaccca 


acaacaatac 


1380 


aagaaaaagt 


atccgtatcc 


agaggggacc 


agggagagca 


tttgttacaa 


taggaaaaat 


1440 


aggaaatatg 


agacaagcac 


attgtaacat 


tagtagagca 


aaatggaatg 


ccactttaaa 


1500 


acagatagct 


agcaaattaa 


gagaacaatt 


tggaaataat 


aaaacaataa 


tctttaagca 


1560 


atcctcagga 


ggggacccag 


aaattgtaac 


gcacagtttt 


aattgtggag 


gggaattttt 


1620 


ctactgtaat 


tcaacacaac 


tgtttaatag 


tacttggttt 


aatagtactt 


ggagtactga 


1680 


agggtcaaat 


aacactgaag 


gaagtgacac 


aatcacactc 


ccatgcagaa 


taaaacaatt 


1740 


tataaacatg 


tggcaggaag 


taggaaaagc 


aatgtatgcc 


cctcccatca 


gtggacaaat 


1800 


tagatgttca 


tcaaatatta 


ctgggctgct 


attaacaaga 


gatggtggta 


ataacaacaa 


1860 


tgggtccgag 


atcttcagac 


ctggaggagg 


cgatatgagg 


gacaattgga 


gaagtgaatt 


1920 


atataaatat 


aaagtagtaa 


aaattgaacc 


attaggagta 


gcacccacca 


aggcaaagag 


1980 


aagagtggtg 


cagactagtg 


cagtgggaat 


aggagctttg 


ttccttgggt 


tcttgggagc 


2040 


agcaggaagc 


actatgggct 


gcacgtcaat 


gacgctgacg 


gtacaggcca 


gacaattatt 


2100 


gtctgatata 


gtgcagcagc 


agaacaattt 


gctgagggct 


attgaggcgc 


aacagcatct 


2160 


gttgcaactc 


acagtctggg 


gcatcaaaca 


gctccaggca 


agaatcctgg 


ctgtggaaag 


2220 


atacctaaag 


gatcaacagc 


tcctggggat 


ttggggttgc 


tctggaaaac 


tcatttgcac 


2280 





cactgctgtg ccttggaatg 


ctagttggag 




catgacctgg atggagtggg 


acagagaaat 




aattgaagaa 


tcgcaaaacc 


agcaagaaaa 




atgggcaagt 


ttgtggaatt 


ggtttaacat 




cataatgata 


gtaggaggct 


tggtaggttt 




gaatagagtt 


aggcagggat 


attcaccatt 




gggacccgac 


aggcccgaag 


gaatagaaga 




cattcgatta gtgaacggat 


ccttagcact 




cttcagctac 


caccgcttga 


gagacttact 




gggacgcagg 


gggtgggaag 


ccctcaaata 




ggaactaaag 
a 


aatagtgctg 


ttaacttgct 


o 

Li 

yy 
O 
W 

in 


<210> 20 
<211> 2746 
<212> DNA 

<213> Artificial sequence 


<220> 

<223> Modified Env/Tat/Rev from 




<400> 20 
gaattctgca 


acaactgctg 


tttatccatt 


H 

: ; 


gcgttactcg 


acagaggaga 


gcaagaaatg 




aagcatccag 


gaagtcagcc 


taaaactgct 


C: 


tttcattgcc 


aagtttgttt 


catgacaaaa 




cggagacagc 


gacgaagagc 


tcatcagaac 




cagtaagtag 


tacatgtaat 


gcaacctata 




ataataatag 


caatagttgt 


gtggtccata 




caaagaaaaa 


tagacaggtt 


aattgataga 




gagagtgaag 


gagaagtatc 


agcacttgtg 




ccttgggata 


ttgatgatct 


gtagtgctac 




ggtacctgtg 


tggaaggaag 


caaccaccac 




tgatacagag 


gtacataatg 


tttgggccac 




acaagaagta 


gtattggtaa 


atgtgacaga 




agaacagatg 


catgaggata 


taatcagttt 



taataaatct ctggaacaga tttggaataa 2 34 0 

taacaattac acaagcttaa tacactcctt 2400 

gaatgaacaa gaattattgg aattagataa 2460 

aacaaattgg ctgtggtata taaaattatt 2520 

aagaatagtt tttgctgtac tttctatagt 2580 

atcgtttcag acccacctcc caatcccgag 2640 

agaaggtgga gagagagaca gagacagatc 27 00 

tatctgggac gatctgcgga gcctgtgcct 2760 

cttgattgta acgaggattg tggaacttct 2820 

ttggtggaat ctcctacagt attggagtca 2880 

caatgccaca gccatagcag tagctgagta 2 94 0 

2941 



pNL4-3 

tcagaattgg gtgtcgacat agcagaatag 60 

gagccagtag atcctagact agagccctgg 12 0 

tgtaccaatt gctattgtaa aaagtgttgc 180 

gccttaggca tctcctatgg caggaagaag 240 

agtcagactc atcaagcttc tctatcaaag 300 

atagtagcaa tagtagcatt agtagtagca 360 

gtaatcatag aatataggaa aatattaaga 420 

ctaatagaaa gagcagaaga cagtggcaat 480 

gagatggggg tggaaatggg gcaccatgct 54 0 

agaaaaattg tgggtcacag tctattatgg 600 

tctattttgt gcatcagatg ctaaagcata 660 

acatgcctgt gtacccacag accccaaccc 72 0 

aaattttaac atgtggaaaa atgacatggt 780 

atgggatcaa agcctaaagc catgtgtaaa 840 



attaacccca ctctgtgtta gttgtaacac ctcagtcatt acacaggcct gtccaaaggt 
atcctttgag ccaattccca tacattattg tgccccggct ggttttgcga ttctaaaatg 
taataataag acgttcaatg gaacaggacc atgtacaaat gtcagcacag tacaatgtac 
acatggaatc aggccagtag tatcaactca actgctgtta aatggcagtc tagcagaaga 
agatgtagta attagatctg ccaatttcac agacaatgct aaaaccataa tagtacagct 
gaacacatct gtagaaatta attgtacaag acccaacaac aatacaagaa aaagtatccg 
tatccagagg ggaccaggga gagcatttgt tacaatagga aaaataggaa atatgagaca 
agcacattgt aacattagta gagcaaaatg gaatgccact ttaaaacaga tagctagcaa 
attaagagaa caatttggaa ataataaaac aataatcttt aagcaatcct caggagggga 
cccagaaatt gtaacgcaca gttttaattg tggaggggaa tttttctact gtaattcaac 
acaactgttt aatagtactt ggtttaatag tacttggagt actgaagggt caaataacac 
tgaaggaagt gacacaatca cactcccatg cagaataaaa caatttataa acatgtggca 
ggaagtagga aaagcaatgt atgcccctcc catcagtgga caaattagat gttcatcaaa 
tattactggg ctgctattaa caagagatgg tggtaataac aacaatgggt ccgagatctt 
cagacctgga ggaggcgata tgagggacaa ttggagaagt gaattatata aatataaagt 
agtaaaaatt gaaccattag gagtagcacc caccaaggca aagagaagag tggtgcagac 
tagtgcagtg ggaataggag ctttgttcct tgggttcttg ggagcagcag gaagcactat 
gggctgcacg tcaatgacgc tgacggtaca ggccagacaa ttattgtctg atatagtgca 
gcagcagaac aatttgctga gggctattga ggcgcaacag catctgttgc aactcacagt 
ctggggcatc aaacagctcc aggcaagaat cctggctgtg gaaagatacc taaaggatca 
acagctcctg gggatttggg gttgctctgg aaaactcatt tgcaccactg ctgtgccttg 
gaatgctagt tggagtaata aatctctgga acagatttgg aataacatga cctggatgga 
gtgggacaga gaaattaaca attacacaag cttaatacac tccttaattg aagaatcgca 
aaaccagcaa gaaaagaatg aacaagaatt attggaatta gataaatggg caagtttgtg 
gaattggttt aacataacaa attggctgtg gtatataaaa ttattcataa tgatagtagg 
aggcttggta ggtttaagaa tagtttttgc tgtactttct atagtgaata gagttaggca 
gggatattca ccattatcgt ttcagaccca cctcccaatc ccgaggggac ccgacaggcc 
cgaaggaata gaagaagaag gtggagagag agacagagac agatccattc gattagtgaa 
cggatcctta gcacttatct gggacgatct gcggagcctg tgcctcttca gctaccaccg 
cttgagagac ttactcttga ttgtaacgag gattgtggaa cttctgggac gcagggggtg 





ggaagccctc 


aaatattggt 


ggaatctcct 


acagtattgg 


agtcaggaac 


taaagaatag 


2700 




tgctgttaac 


ttgctcaatg 


ccacagccat 


agcagtagct 


gagtaa 




2746 




<210> 21 
<211> 3417 
<212> DNA 

<213> Artificial sequence 












<220> 

<223> Modified Env/Tat/Rev/Nef from strain 


BH10 








<400> 21 
gaattctgca 


acaactgctg 


tttatccatt 


ttcagaattg 


ggtgtcgaca 


tagcagaata 


60 




ggcgttactc 


gacagaggag 


agcaagaaat 


ggagccagta 


gatcctagac 


tagagccctg 


120 




gaagcatcca 


ggaagtcagc 


ctaaaactgc 


ttgtaccaat 


tgctattgta aaaagtgttg 


180 




ctttcattgc 


caagtttgtt 


tcataacaaa 


agccttaggc 


atctcctatg gcaggaagaa 


240 




gcggagacag 


cgacgaagac 


ctcctcaagg 


cagtcagact 


catcaagttt 


ctctatcaaa 


300 




gcagtaagta 


gtacatgtaa 


tgcaacctat 


acaaatagca 


atagtagcat 


tagtagtagc 


360 




aataataata 


gcaatagttg 


tgtggtccat 


agtaatcata 


gaatatagga 


aaatattaag 


420 




acaaagaaaa 


atagacaggt 


taattgatag 


actaatagaa 


agagcagaag 


acagtggcaa 


480 


w 


tgagagtgaa 


ggagaaatat 


cagcacttgt 


ggagatgggg 


gtggagatgg 


ggcaccatgc 


540 




tccttgggat 


gttgatgatc 


tgtagtgcta 


cagaaaaatt 


gtgggtcaca gtctattatg 


600 


M 


gggtacctgt 


gtggaaggaa 


gcaaccacca 


ctctattttg 


tgcatcagat 


gctaaagcat 


660 




atgatacaga 


ggtacataat 


gtttgggcca 


cacatgcctg 


tgtacccaca gaccccaacc 


720 


cacaagaagt 


agtattggta 


aatgtgacag 


aaaattttaa 


catgtggaaa 


aatgacatgg 


780 




tagaacagat 


gcatgaggat 


ataatcagtt 


tatgggatca 


aagcctaaag 


ccatgtgtaa 


840 




aattaacccc 


actctgtgtt 


agtttaaagt 


gcactgattt 


gaagaatgat 


actaatacca 


900 




atagtagtag 


cgggagaatg 


ataatggaga 


aaggagagat 


aaaaaactgc 


tctttcaata 


960 




tcagcacaag 


cataagaggt 


aaggtgcaga 


aagaatatgc 


atttttttat 


aaacttgata 


1020 




taataccaat 


agataatgat 


actaccagct 


atacgttgac 


aagttgtaac 


acctcagtca 


1080 




ttacacaggc 


ctgtccaaag 


gtatcctttg 


agccaattcc 


catacattat 


tgtgccccgg 


1140 




ctggttttgc 


gattctaaaa 


tgtaataata 


agacgttcaa 


tggaacagga 


ccatgtacaa 


1200 




atgtcagcac 


agtacaatgt 


acacatggaa 


ttaggccagt 


agtatcaact 


caactgctgt 


±z bU 




taaatggcag 


tctggcagaa 


gaagaggtag 


taattagatc 


tgccaatttc 


acagacaatg 


1320 




ctaaaaccat 


aatagtacag 


ctgaaccaat 


ctgtagaaat 


taattgtaca 


agacccaaca 


1380 




acaatacaag 


aaaaagtatc 


cgtatccaga 


gaggaccagg 


gagagcattt 


gttacaatag 


1440 



gaaaaatagg aaatatgaga caagcacatt gtaacattag tagagcaaaa tggaataaca 
ctttaaaaca gatagatagc aaattaagag aacaatttgg aaataataaa acaataatct 
ttaagcagtc ctcaggaggg gacccagaaa ttgtaacgca cagttttaat tgtggagggg 
aatttttcta ctgtaattca acacaactgt ttaatagtac ttggtttaat agtacttgga 
gtactaaagg gtcaaataac actgaaggaa gtgacacaat caccctccca tgcagaataa 
aacaaattat aaacatgtgg caggaagtag gaaaagcaat gtatgcccct cccatcagtg 
gacaaattag atgttcatca aatattacag ggctgctatt aacaagagat ggtggtaata 
gcaacaatga gtccgagatc ttcagacctg gaggaggaga tatgagggac aattggagaa 
gtgaattata taaatataaa gtagtaaaaa ttgaaccatt aggagtagca cccaccaagg 
caaagagaag agtggtgcag actagtgcag tgggaatagg agctttgttc cttgggttct 
tgggagcagc aggaagcact atgggcgcag cgtcaatgac gctgacggta caggccagac 
aattattgtc tggtatagtg cagcagcaga acaatttgct gagggctatt gaggcgcaac 
agcatctgtt gcaactcaca gtctggggca tcaagcagct ccaggcaaga atcctggctg 
tggaaagata cctaaaggat caacagctcc tggggatttg gggttgctct ggaaaactca 
tttgcaccac tgctgtgcct tggaatgcta gttggagtaa taaatctctg gaacagattt 
ggaataacat gacctggatg gagtgggaca gagaaattaa caattacaca agcttaatac 
actccttaat tgaagaatcg caaaaccagc aagaaaagaa tgaacaagaa ttattggaat 
tagataaatg ggcaagtttg tggaattggt ttaacataac aaattggctg tggtatataa 
aattattcat aatgatagta ggaggcttgg taggtttaag aatagttttt gctgtacttt 
ctgtagtgaa tagagttagg cagggatatt caccattatc gtttcagacc cacctcccaa 
tcccgagggg acccgacagg cccgaaggaa tagaagaaga aggtggagag agagacagag 
acagatccat tcgattagtg aacggatcct tagcacttat ctgggacgat ctgcggagcc 
tgtgcctctt cagctaccac cgcttgagag acttactctt gattgtaacg aggattgtgg 
aacttctggg acgcaggggg tgggaagccc tcaaatattg gtggaatctc ctacagtatt 
ggagtcagga gctaaagaat agtgctgtta gcttgctcaa tgccacagct atagcagtag 
ctgaggggac agatagggtt atagaagtag tacaaggagc ttatagagct attcgccaca 
tacctagaag aataagacag ggcttggaaa ggattttgct ataagatggg tggcaagtgg 
tcaaaaagta gtgtggttgg atggcctgct gtaagggaaa gaatgagacg agctgagcca 
gcagcagatg gggtgggagc agcatctcga gacctagaaa aacatggagc aatcacaagt 
agcaacacag cagctaacaa tgctgattgt gcctggctag aagcacaaga ggaggaggag 



gtgggttttc cagtcacacc tcaggtacct ttaagaccaa 
gatcttagcc actttttaaa agaaaagggg ggactggaag 
agacaagata tccttgatct gtggatctac cacacacaag 



tgacttacaa 
ggctaattca 
gctacttccc 



ggcagctgta 
ctcccaacga 
tgattag 



3300 
3360 
3417 



<210> 22 

<211> 2950 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Modified Env/Nef from strain BH10 

<400> 22 





aaattcacca 


c ca tcrcrcracrt 


craacrcracraaa 


tatcacrcact 


tataaaaata 


aaaa t aaaaa 

yyy y *-yy "-y «■ 


60 




taaaacacca 

w « t4. ^ V-/ W. 


tactccttaa 


aatatt crater 


atctataata 


ctacagaaaa 


attgtgggtc 


120 




acagt c tat t 


atcrcrcrata.ee 


tgtgtggaag 


gaagcaacca 


ccactctatt 


ttgtgcatca 


180 




a a t" err* f~ a 3 a cr 


{"•a t a t* era fc a c 

l^ cl y c*. dw 


aaaoafaca t 

Ct^j ^-1 ^ O Ci d, w 


aatatt tQcrcr 


ccacacatac 


ctatataccc 


240 




a u ay a u u u u a 




aa1~aat~a 1 1~a 

ay oay i_a i— Ly 


at a aa tataa 

y i~>aaav~y i— y cl 


paaa aas 1 1 1 


taaca fcataa 

*— dci's.u. »— y »^y y 


3 00 


Q 


dCLCLCtCl 1— y CL U CI 


uyy Lay aaua 


crs ■here a t~as cr 
y a. uy ^a. ay 


crs t" S t" S S t* c S 
y a l~ a i» aa v— v_ ci 


crt" t" t" 3 1" craaa 
y u u ua Lyyy a 


>_ \_- CL CL CL y V_ K^> C CL 


360 






Laaad. L Laau 


uuuau l u Ly u 


y u u ay u l u a a 


ay uyuau Lya 


t* t* t* era a era a 1~ 
l l Ly a ay aa l 


49 0 


I , = 


era t"arhsa1"a 

yauaULaaLa 


ppaa j-arrt" a er 

uuaauay Lay 


f- a crpcrcrcrs na 
uayuyyyaya 


a f~ era faat - rrn 
a ^-y a i— a a ^-y y 


a na a a nrra era 

ciy ctciQ.^y cl ^-j cl 


era faaaaaap 

y V— CL CL CL CL CL CL ^ 


480 






a u a uu ay u av_* 


S a rfPP faa ers 
a cl y >^ cl i_. ci. ay a 


yy Laay y i_y u 


a <"ia a acrs at*a 

a^j aaa^j a a v— a 


tacatttttt 

C ^wj CL C- C O C* C L^ 


540 


M 

z .. 


L a L a. da u L L-y 


a La L ad L aUU 


ad Lay a Lad L 


na i~ apt - a ppa 
yduaULdUUd 


yuuauauy ll 


y a u ciay u uy u 


600 




a Ti r*t ^ f* \~ /~t ^ 

aauauu UCay 


LUdL Uauaua 


y y c u uy t. u u a 


day y Let UUu U 


hfrranppa at* 
l Ly ay uuaau 


hpppa t*a pat" 
LuuuauauaL 






tattgtgccc 


cggctggttt 


tgegattcta 


aaatgtaata 


ataagaegtt 


caatggaaca 


720 




ggaccatgta 


caaatgtcag 


cacagtacaa 


tgtacacatg 


gaattaggee 


agtagtatca 


780 




actcaactgc 


tgttaaatgg 


cagtctggca 


gaagaagagg 


tagtaattag 


atetgecaat 


840 




ttcacagaca 


atgetaaaac 


cataatagta 


cagctgaacc 


aatctgtaga 


aattaattgt 


900 




acaagaccca 


acaacaatac 


aagaaaaagt 


atccgtatcc 


agagaggacc 


agggagagca 


960 




tttgttacaa 


taggaaaaat 


aggaaatatg 


agacaagcac 


attgtaacat 


tagtagagca 


1020 




aaatggaata 


acactttaaa 


acagatagat 


agcaaattaa 


gagaacaatt 


tggaaataat 


1080 




aaaacaataa 


tctttaagca 


gtcctcagga 


ggggacccag 


aaattgtaac 


gcacagtttt 


1140 




aattgtggag 


gggaattttt 


ctactgtaat 


tcaacacaac 


tgtttaatag 


tacttggttt 


1200 




aatagtactt 


ggagtactaa 


agggtcaaat 


aacactgaag 


gaagtgacac 


aatcaccctc 


1260 




ecatgeagaa 


taaaacaaat 


tataaacatg 


tggcaggaag 


taggaaaagc 


aatgtatgcc 


1320 




cctcccatca 


gtggacaaat 


tagatgttca 


tcaaatatta 


cagggctget 


attaacaaga 


1380 





gatggtggta 


atagcaacaa 


tgagtccgag 


atcttcagac 


ctggaggagg 


agatatgagg 


1440 




gacaattgga 


gaagtgaatt 


atataaatat 


aaagtagtaa 


aaattgaacc 


attaggagta 


1500 




gcacccacca 


aggcaaagag 


aagagtggtg 


cagactagtg 


cagtgggaat 


aggagctttg 


1560 




ttccttgggt 


tcttgggagc 


agcaggaagc 


actatgggcg 


cagcgtcaat 


gacgctgacg 


1620 




gtacaggcca 


gacaattatt 


gtctggtata 


gtgcagcagc 


agaacaattt 


gctgagggct 


1680 




attgaggcgc 


aacagcatct 


gttgcaactc 


acagtctggg 


gcatcaagca 


gctccaggca 


1740 




agaatcctgg 


ctgtggaaag 


atacctaaag 


gatcaacagc 


tcctggggat 


ttggggttgc 


1800 




tctggaaaac 


tcatttgcac 


cactgctgtg 


ccttggaatg 


ctagttggag 


taataaatct 


1860 




ctggaacaga 


tttggaataa 


catgacctgg 


atggagtggg 


acagagaaat 


taacaattac 


1920 




acaagcttaa 


tacactcctt 


aattgaagaa 


tcgcaaaacc 


agcaagaaaa 


gaatgaacaa 


1980 




gaattattgg 


aattagataa 


atgggcaagt 


ttgtggaatt 


ggtttaacat 


aacaaattgg 


2040 




ctgtggtata 


taaaattatt 


cataatgata 


gtaggaggct 


tggtaggttt 


aagaatagtt 


2100 


y 


tttgctgtac 


tttctgtagt 


gaatagagtt 


aggcagggat 


attcaccatt 


atcgtttcag 


2160 




acccacctcc 


caatcccgag 


gggacccgac 


aggcccgaag 


gaatagaaga 


agaaggtgga 


2220 


fii 


gagagagaca 


gagacagatc 


cattcgatta 


gtgaacggat 


ccttagcact 


tatctgggac 


2280 


m 


gatctgcgga 


gcctgtgcct 


cttcagctac 


caccgcttga 


gagacttact 


cttgattgta 


2340 


s 


acgaggattg 


tggaacttct 


gggacgcagg 


gggtgggaag 


ccctcaaata 


ttggtggaat 


2400 


gssss 


ctcctacagt 


attggagtca 


ggagctaaag 


aatagtgctg 


ttagcttgct 


caatgccaca 


2460 


gctatagcag 


tagctgaggg 


gacagatagg 


gttatagaag 


tagtacaagg 


agcttataga 


2520 


D 


gctattcgcc 


acatacctag 


aagaataaga 


cagggcttgg 


aaaggatttt 


gctataagat 


2580 




gggtggcaag 


tggtcaaaaa 


gtagtgtggt 


tggatggcct 


gctgtaaggg 


aaagaatgag 


2640 




acgagctgag 


ccagcagcag 


atggggtggg 


agcagcatct 


cgagacctag 


aaaaacatgg 


2700 




agcaatcaca 


agtagcaaca 


cagcagctaa 


caatgctgat 


tgtgcctggc 


tagaagcaca 


2760 




agaggaggag 


gaggtgggtt 


ttccagtcac 


acctcaggta 


cctttaagac 


caatgactta 


2820 




caaggcagct 


gtagatctta 


gccacttttt 


aaaagaaaag 


gggggactgg 


aagggctaat 


2880 




tcactcccaa 


cgaagacaag 


atatccttga 


tctgtggatc 


taccacacac 


aaggctactt 


2940 



ccctgattag 2 950 



<210> 23 

<211> 2747 

<212> DNA 

<213> Artificial sequence 



60 



240 
300 
360 
420 
480 



600 
660 
720 



<220> 

<223> Modified Env/Tat from strain BH10 
<400> 23 

gaattctgca acaactgctg tttatccatt ttcagaattg ggtgtcgaca tagcagaata 
ggcgttactc gacagaggag agcaagaaat ggagccagta gatcctagac tagagccctg 12 0 
gaagcatcca ggaagtcagc ctaaaactgc ttgtaccaat tgctattgta aaaagtgttg 180 
ctttcattgc caagtttgtt tcataacaaa agccttaggc atctcctatg gcaggaagaa 
gcggagacag cgacgaagac ctcctcaagg cagtcagact catcaagttt ctctatcaaa 
gcagtaagta gtacatgtaa tgcaacctat acaaatagca atagtagcat tagtagtagc 
aataataata gcaatagttg tgtggtccat agtaatcata gaatatagga aaatattaag 
acaaagaaaa atagacaggt taattgatag actaatagaa agagcagaag acagtggcaa 
tgagagtgaa ggagaaatat cagcacttgt ggagatgggg gtggagatgg ggcaccatgc 54 0 
tccttgggat gttgatgatc tgtagtgcta cagaaaaatt gtgggtcaca gtctattatg 
n gggtacctgt gtggaaggaa gcaaccacca ctctattttg tgcatcagat gctaaagcat 

zz atgatacaga ggtacataat gtttgggcca cacatgcctg tgtacccaca gaccccaacc 

W cacaagaagt agtattggta aatgtgacag aaaattttaa catgtggaaa aatgacatgg 7 80 

Q 

y tagaacagat gcatgaggat ataatcagtt tatgggatca aagcctaaag ccatgtgtaa 840 

111 aattaacccc actctgtgtt agtttaaagt gcactgattt gaagaatgat actaatacca 900 

s 

H atagtagtag cgggagaatg ataatggaga aaggagagat aaaaaactgc tctttcaata 960 

O tcagcacaag cataagaggt aaggtgcaga aagaatatgc atttttttat aaacttgata 1020 

% taataccaat agataatgat actaccagct atacgttgac aagttgtaac acctcagtca 1080 

H : ttacacaggc ctgtccaaag gtatcctttg agccaattcc catacattat tgtgccccgg 114 0 

ctggttttgc gattctaaaa tgtaataata agacgttcaa tggaacagga ccatgtacaa 1200 

atgtcagcac agtacaatgt acacatggaa ttaggccagt agtatcaact caactgctgt 1260 

taaatggcag tctggcagaa gaagaggtag taattagatc tgccaatttc acagacaatg 132 0 

ctaaaaccat aatagtacag ctgaaccaat ctgtagaaat taattgtaca agacccaaca 13 8 0 

acaatacaag aaaaagtatc cgtatccaga gaggaccagg gagagcattt gttacaatag 1440 

gaaaaatagg aaatatgaga caagcacatt gtaacattag tagagcaaaa tggaataaca 1500 

ctttaaaaca gatagatagc aaattaagag aacaatttgg aaataataaa acaataatct 1560 

ttaagcagtc ctcaggaggg gacccagaaa ttgtaacgca cagttttaat tgtggagggg 1620 

aatttttcta ctgtaattca acacaactgt ttaatagtac ttggtttaat agtacttgga 1680 

gtactaaagg gtcaaataac actgaaggaa gtgacacaat caccctccca tgcagaataa 1740 



hi 



aacaaattat aaacatgtgg caggaagtag gaaaagcaat gtatgcccct cccatcagtg 180 0 
gacaaattag atgttcatca aatattacag ggctgctatt aacaagagat ggtggtaata 1860 
gcaacaatga gtccgagatc ttcagacctg gaggaggaga tatgagggac aattggagaa 
gtgaattata taaatataaa gtagtaaaaa ttgaaccatt aggagtagca cccaccaagg 
caaagagaag agtggtgcag actagtgcag tgggaatagg agctttgttc cttgggttct 2 040 
tgggagcagc aggaagcact atgggcgcag cgtcaatgac gctgacggta caggccagac 
aattattgtc tggtatagtg cagcagcaga acaatttgct gagggctatt gaggcgcaac 
agcatctgtt gcaactcaca gtctggggca tcaagcagct ccaggcaaga atcctggctg 
tggaaagata cctaaaggat caacagctcc tggggatttg gggttgctct ggaaaactca 
tttgcaccac tgctgtgcct tggaatgcta gttggagtaa taaatctctg gaacagattt 2340 
ggaataacat gacctggatg gagtgggaca gagaaattaa caattacaca agcttaatac 
actccttaat tgaagaatcg caaaaccagc aagaaaagaa tgaacaagaa ttattggaat 
tagataaatg ggcaagtttg tggaattggt ttaacataac aaattggctg tggtatataa 
aattattcat aatgatagta ggaggcttgg taggtttaag aatagttttt gctgtacttt 
ctgtagtgaa tagagttagg cagggatatt caccattatc gtttcagacc cacctcccaa 
tcccgagggg acccgacagg cccgaaggaa tagaagaaga aggtggagag agagacagag 
acagatccat tcgattagtg aacggatcct tagcacttat ctggtaa 



Q <210> 24 

U <211> 2583 



<212> DNA 

<213> Artificial sequence 
<220> 

<223> Modified Env 
<400> 24 

gaattcgcca ccatgggagt gaaggagaaa tatcagcact tgtggagatg ggggtggaga 
tggggcacca tgctccttgg gatgttgatg atctgtagtg ctacagaaaa attgtgggtc 
acagtctatt atggggtacc tgtgtggaag gaagcaacca ccactctatt ttgtgcatca 
gatgctaaag catatgatac agaggtacat aatgtttggg ccacacatgc ctgtgtaccc 
acagacccca acccacaaga agtagtattg gtaaatgtga cagaaaattt taacatgtgg 
aaaaatgaca tggtagaaca gatgcatgag gatataatca gtttatggga tcaaagccta 
aagccatgtg taaaattaac cccactctgt gttagtttaa agtgcactga tttgaagaat 
gatactaata ccaatagtag tagcgggaga atgataatgg agaaaggaga gataaaaaac 



1920 
1980 



2100 
2160 
2220 
2280 



2400 
2460 
2520 
2580 
2640 
2700 
2747 



60 
120 
180 
240 
300 
360 
420 
480 



r 



tgctctttca atatcagcac aagcataaga ggtaaggtgc agaaagaata tgcatttttt 540 
tataaacttg atataatacc aatagataat gatactacca gctatacgtt gacaagttgt 600 
aacacctcag tcattacaca ggcctgtcca aaggtatcct ttgagccaat tcccatacat 
tattgtgccc cggctggttt tgcgattcta aaatgtaata ataagacgtt caatggaaca 
ggaccatgta caaatgtcag cacagtacaa tgtacacatg gaattaggcc agtagtatca 
actcaactgc tgttaaatgg cagtctggca gaagaagagg tagtaattag atctgccaat 
ttcacagaca atgctaaaac cataatagta cagctgaacc aatctgtaga aattaattgt 
acaagaccca acaacaatac aagaaaaagt atccgtatcc agagaggacc agggagagca 
tttgttacaa taggaaaaat aggaaatatg agacaagcac attgtaacat tagtagagca 
aaatggaata acactttaaa acagatagat agcaaattaa gagaacaatt tggaaataat 
aaaacaataa tctttaagca gtcctcagga ggggacccag aaattgtaac gcacagtttt 
aattgtggag gggaattttt ctactgtaat tcaacacaac tgtttaatag tacttggttt 
aatagtactt ggagtactaa agggtcaaat aacactgaag gaagtgacac aatcaccctc 
ccatgcagaa taaaacaaat tataaacatg tggcaggaag taggaaaagc aatgtatgcc 
W cctcccatca gtggacaaat tagatgttca tcaaatatta cagggctgct attaacaaga 1380 

gatggtggta atagcaacaa tgagtccgag atcttcagac ctggaggagg agatatgagg 
f " gacaattgga gaagtgaatt atataaatat aaagtagtaa aaattgaacc attaggagta 

gcacccacca aggcaaagag aagagtggtg cagagagaaa aaagagcagt gggaatagga 
0 gctttgttcc ttgggttctt gggagcagca ggaagcacta tgggcgcagc gtcaatgacg 

ctgacggtac aggccagaca attattgtct ggtatagtgc agcagcagaa caatttgctg 
agggctattg aggcgcaaca gcatctgttg caactcacag tctggggcat caagcagctc 
caggcaagaa tcctggctgt ggaaagatac ctaaaggatc aacagctcct ggggatttgg 
ggttgctctg gaaaactcat ttgcaccact gctgtgcctt ggaatgctag ttggagtaat 
aaatctctgg aacagatttg gaataacatg acctggatgg agtgggacag agaaattaac 
aattacacaa gcttaataca ctccttaatt gaagaatcgc aaaaccagca agaaaagaat 
gaacaagaat tattggaatt agataaatgg gcaagtttgt ggaattggtt taacataaca 2 04 0 
aattggctgt ggtatataaa attattcata atgatagtag gaggcttggt aggtttaaga 2100 
atagtttttg ctgtactttc tgtagtgaat agagttaggc agggatattc accattatcg 2160 
tttcagaccc acctcccaat cccgagggga cccgacaggc ccgaaggaat agaagaagaa 2220 
ggtggagaga gagacagaga cagatccatt cgattagtga acggatcctt agcacttatc 
tgggacgatc tgcggagcct gtgcctcttc agctaccacc gcttgagaga cttactcttg 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



2280 
2340 



attgtaacga ggattgtgga acttctggga cgcagggggt gggaagccct caaatattgg 
tggaatctcc tacagtattg gagtcaggag ctaaagaata gtgctgttag cttgctcaat 
gccacagcta tagcagtagc tgaggggaca gatagggtta tagaagtagt acaaggagct 
tatagagcta ttcgccacat acctagaaga ataagacagg gcttggaaag gattttgcta 
taa 



<210> 25 
<211> 108 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 25 

tgtacaagac ccaacaacaa tacaagaaaa agtatccgta tccagagagg accagggaga 
gcatttgtta caataggaaa aataggaaat atgagacaag cacattgt 

<210> 26 
<211> 105 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 26 

tgtaccagac ctaacaacaa tacaagaaaa agtgtacgta taggaccagg acaaacattc 
tatgcaacag gtgatataat aggggatata agacaagcac attgt 

<210> 27 
<2ll> 105 
<212> DNA 

<213> Human immunodeficiency virus type 1 

tgtacgagac ccaacaataa tacaagaaaa agtataagga taggaccagg acaagcattc 
tatgcaacag gagaaataat aggagatata agacaagcac attgt 



<210> 28 
<211> 102 
<212> DNA 

<213> Human immunodeficiency virus type 1 

Jgcacaaggc cctacaacaa tataagacaa aggaccccca taggactagg gcaagcactc 



tatacaacaa gaagaataga agatataaga agagcacatt gt 



<210> 29 

<211> 105 

<212> DNA 

<213> Human immunodeficiency virus type 1 



<400> 29 

tgtaccagac cctccaccaa tacaagaaca agtatacgta taggaccagg acaagtattc 
tatagaacag gagacataac aggagatata agaaaagcat attgt 



<210> 30 
<211> 105 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 30 

tgtacaagac ccaacaacaa tacaagaaaa agaatatctt taggaccagg acgagtattt 
tatacagcag gagaaataat aggagacatc agaaaggcac attgt 



<210> 31 
<211> 105 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 31 

tgtaccagac ctaataacaa tacaagaaaa agtataactt ttgcaccagg acaagcgctc 
tatgcaacag gtgaaataat aggagatata agacaagcac attgt 



60 
105 



60 
105 



60 
105 



W <210> 32 

Q <211> 2562 

kl <212> DNA 

tk <213> Artificial sequence 

u e 

? <220> 

H : <223> Env with multi-clade V3 loops 

'""^ < 4 0 0 ^ 32 

U atgagagtga aggagaaata tcagcacttg tggagatggg ggtggagatg gggcaccatg 60 

P ctccttggga tgttgatgat ctgtagtgct acagaaaaat tgtgggtcac agtctattat 12 0 

ggggtacctg tgtggaagga agcaaccacc actctatttt gtgcatcaga tgctaaagca 180 
tatgatacag aggtacataa tgtttgggcc acacatgcct gtgtacccac agaccccaac 240 
ccacaagaag tagtattggt aaatgtgaca gaaaatttta acatgtggaa aaatgacatg 300 
gtagaacaga tgcatgagga tataatcagt ttatgggatc aaagcctaaa gccatgtgta 360 
aaattaaccc cactctgtgt tggagctggt agttgtaaca cctcagtcat tacacaggcc 
tgtccaaagg tatcctttga gccaattccc atacattatt gtgccccggc tggttttgcg 
attctaaaat gtaataataa gacgttcaat ggaacaggac catgtacaaa tgtcagcaca 
gtacaatgta cacatggaat taggccagta gtatcaactc aactgctgtt aaatggcagt 
ctggcagaag aagaggtagt aattagatct gccaatttca cagacaatgc taaaaccata 
atagtacagc tgaaccaatc tgtagaaatt aattgtacaa gacccaacaa caatacaaga 



420 
480 
540 
600 
660 
720 



aaaagtatcc gtatccagag aggaccaggg agagcatttg ttacaatagg aaaaatagga 
aatatgagac aagcacattg tctcgggtgt accagaccta acaacaatac aagaaaaagt 
gtacgtatag gaccaggaca aacattctat gcaacaggtg atataatagg ggatataaga 
caagcacatt gttgtacgag acccaacaat aatacaagaa aaagtataag gataggacca 
ggacaagcat tctatgcaac aggagaaata ataggagata taagacaagc acattgttgc 
acaaggccct acaacaatat aagacaaagg acccccatag gactagggca agcactctat 
acaacaagaa gaatagaaga tataagaaga gcacattgtt gtaccagacc ctccaccaat 
acaagaacaa gtatacgtat aggaccagga caagtattct atagaacagg agacataaca 
ggagatataa gaaaagcata ttgtggatcc tgtacaagac ccaacaacaa tacaagaaaa 
agaatatctt taggaccagg acgagtattt tatacagcag gagaaataat aggagacatc 
agaaaggcac attgttgtac cagacctaat aacaatacaa gaaaaagtat aacttttgca 
ccaggacaag cgctctatgc aacaggtgaa ataataggag atataagaca agcacattgt 
ctcgggaaca ttagtagagc aaaatggaat aacactttaa aacagataga tagcaaatta 
agagaacaat ttggaaataa taaaacaata atctttaagc agtcctcagg aggggaccca 
gaaattgtaa cgcacagttt taattgtgga ggggaatttt tctactgtaa ttcaacacaa 
ctgtttaata gtacttggtt taatagtact tggagtacta aagggtcaaa taacactgaa 
ggaagtgaca caatcaccct cccatgcaga ataaaacaaa ttataaacat gtggcaggaa 
gtaggaaaag caatgtatgc ccctcccatc agtggacaaa ttagatgttc atcaaatatt 
acagggctgc tattaacaag agatggtggt aatagcaaca atgagtccga gatcttcaga 
cctggaggag gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta 
aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt gcagactagt 
gcagtgggaa taggagcttt gttccttggg ttcttgggag cagcaggaag cactatgggc 
gcagcgtcaa tgacgctgac ggtacaggcc agacaattat tgtctggtat agtgcagcag 
cagaacaatt tgctgagggc tattgaggcg caacagcatc tgttgcaact cacagtctgg 
ggcatcaagc agctccaggc aagaatcctg gctgtggaaa gatacctaaa ggatcaacag 
ctcctgggga tttggggttg ctctggaaaa ctcatttgca ccactgctgt gccttggaat 
gctagttgga gtaataaatc tctggaacag atttggaata acatgacctg gatggagtgg 
gacagagaaa ttaacaatta cacaagctta atacactcct taattgaaga atcgcaaaac 
cagcaagaaa agaatgaaca agaattattg gaattagata aatgggcaag tttgtggaat 
tggtttaaca taacaaattg gctgtggtat ataaaatcgt ggctgctgct gctcctgctc 
tccctctccc tcctccaggc cacggatttc atgtccctgt ga 



<210> 33 
<211> 853 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Modified Env with multi-clade V3 loops 
<400> 33 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
15 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Thr Glu 
20 25 30 



Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
35 40 45 



Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
50 55 60 



Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65 70 75 80 



Pro Gin Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 
85 90 95 



Lys Asn Asp Met Val Glu Gin Met His Glu Asp lie lie Ser Leu Trp 
100 105 110 



Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Gly 
115 120 125 



Ala Gly Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys Pro Lys Val 
13 0 135 14 0 



Ser Phe Glu Pro lie Pro lie His Tyr Cys Ala Pro Ala Gly Phe Ala 
145 150 155 160 



lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr 

165 170 175 



Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val Ser 
180 185 190 



Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val He 



195 



200 



205 



Arg Ser Ala Asn Phe Thr Asp Asn Ala Lys Thr lie lie Val Gin Leu 
210 215 220 



Asn Gin Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg 
225 230 235 240 



Lys Ser lie Arg lie Gin Arg Gly Pro Gly Arg Ala Phe Val Thr lie 
245 " 250 255 



Gly Lys lie Gly Asn Met Arg Gin Ala His Cys Leu Gly Cys Thr Arg 
260 265 270 



Pro Asn Asn Asn Thr Arg Lys Ser Val Arg lie Gly Pro Gly Gin Thr 

275 280 285 



Phe Tyr Ala Thr Gly Asp He He Gly Asp He Arg Gin Ala His Cys 
290 295 300 



Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Arg He Gly Pro 
305 " 310 315 320 



Gly Gin Ala Phe Tyr Ala Thr Gly Glu He He Gly Asp He Arg Gin 
325 330 335 



Ala His Cys Cys Thr Arg Pro Tyr Asn Asn He Arg Gin Arg Thr Pro 
340 345 350 



He Gly Leu Gly Gin Ala Leu Tyr Thr Thr Arg Arg He Glu Asp He 
355 360 365 



Arg Arg Ala His Cys Cys Thr Arg Pro Ser Thr Asn Thr Arg Thr Ser 
370 ' 375 380 



He Arg He Gly Pro Gly Gin Val Phe Tyr Arg Thr Gly Asp He Thr 
385 ~ 390 395 400 



Gly Asp He Arg Lys Ala Tyr Cys Gly Ser Cys Thr Arg Pro Asn Asn 
405 410 415 



Asn Thr Arg Lys Arg He Ser Leu Gly Pro Gly Arg Val Phe Tyr Thr 
420 425 430 



Ala Gly Glu He He Gly Asp He Arg Lys Ala His Cys Cys Thr Arg 
435 440 445 



Pro Asn Asn Asn Thr Arg Lys Ser He Thr Phe Ala Pro Gly Gin Ala 
450 455 460 



Leu Tyr Ala Thr Gly Glu He He Gly Asp He Arg Gin Ala His Cys 
465 470 475 480 

Leu Gly Asn He Ser Arg Ala Lys Trp Asn Asn Thr Leu Lys Gin He 
485 490 495 



Asp Ser Lys Leu Arg Glu Gin Phe Gly Asn Asn Lys Thr He He Phe 
500 505 ~ 510 



Lys Gin Ser Ser Gly Gly Asp Pro Glu He Val Thr His Ser Phe Asn 
515 520 525 



Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gin Leu Phe Asn Ser 
530 535 540 



Thr Trp Phe Asn Ser Thr Trp Ser Thr Lys Gly Ser Asn Asn Thr Glu 
54 5 550 555 560 

Gly Ser Asp Thr He Thr Leu Pro Cys Arg He Lys Gin He He Asn 
565 570 575 



Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro He Ser Gly 
580 585 590 



Gin He Arg Cys Ser Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp 
595 600 ~ 605 

Gly Gly Asn Ser Asn Asn Glu Ser Glu He Phe Arg Pro Gly Gly Gly 
610 615 620 



Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val 
625 630 635 ^ 640 

Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val 
645 650 655 



Val Gin Thr Ser Ala Val Gly He Gly Ala Leu Phe Leu Gly Phe Leu 
660 665 670 



Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val 
675 680 685 



Gin Ala Arg Gin Leu Leu Ser Gly lie Val Gin Gin Gin Asn Asn Leu 
690 695 700 



yy 



111 



Leu Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp 
705 710 715 720 



Gly lie Lys Gin Leu Gin Ala Arg lie Leu Ala Val Glu Arg Tyr Leu 
725 730 735 



Lys Asp Gin Gin Leu Leu Gly lie Trp Gly Cys Ser Gly Lys Leu lie 
74 0 745 4 750 



Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 
755 760 765 



Glu Gin lie Trp Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu lie 
770 775 780 



Asn Asn Tyr Thr Ser Leu lie His Ser Leu lie Glu Glu Ser Gin Asn 
785 790 795 800 



Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala 
805 810 815 



Ser Leu Trp Asn Trp Phe Asn lie Thr Asn Trp Leu Trp Tyr lie Lys 
82 0 ~ 825 83 0 



Ser Trp Leu Leu Leu Leu Leu Leu Ser Leu Ser Leu Leu Gin Ala Thr 
835 840 845 



Asp Phe Met Ser Leu 
850 



<210> 34 
<211> 1092 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 34 

atgggtgcga gagcgtcagt attaagcggg ggagaattag atcgatggga aaaaattcgg 60 
ttaaggccag ggggaaagaa aaaatataaa ttaaaacata tagtatgggc aagcagggag 12 0 
ctagaacgat tcgcagttaa tcctggcctg ttagaaacat cagaaggctg tagacaaata 180 
ctgggacagc tacaaccatc ccttcagaca ggatcagaag aacttagatc attatataat 240 



acagtagcaa ccctctattg tgtgcatcaa aggatagaga taaaagacac caaggaagct 300 



ttagacaaga 


tagaggaaga 


gcaaaacaaa 


agtaagaaaa aagcacagca agcagcagct 


360 


gacacaggac 


acagcagtca 


ggtcagccaa 


aattacccta tagtgcagaa 


catccagggg 


420 


caaatggtac 


atcaggccat 


atcacctaga 


actttaaatg catgggtaaa 


agtagtagaa 


480 


gagaaggctt 


tcagcccaga 


agtaataccc 


atgttttcag cattatcaga 


aggagccacc 


540 


ccacaagatt 


taaacaccat 


gctaaacaca 


gtggggggac atcaagcagc 


catgcaaatg 


600 


ttaaaagaga 


ccatcaatga 


ggaagctgca 


gaatgggata gagtacatcc 


agtgcatgca 


660 


gggcctattg 


caccaggcca 


gatgagagaa 


ccaaggggaa gtgacatagc 


aggaactact 


720 


agtacccttc 


aggaacaaat 


aggatggatg 


acaaataatc cacctatccc 


agtaggagaa 


780 


atttataaaa 


gatggataat 


cctgggatta 


aataaaatag taagaatgta 


tagccctacc 


840 


agcattctgg 


acataagaca 


aggaccaaaa 


gaacctttta gagactatgt 


agaccggttc 


900 


tataaaactc 


taagagccga 


gcaagcttca 


caggaggtaa aaaattggat 


gacagaaacc 


960 


ttgttggtcc 


aaaatgcgaa 


cccagattgt 


aagactattt taaaagcatt 


gggaccagcg 


1020 


gctacactag 


aagaaatgat 


gacagcatgt 


cagggagtag gaggacccgg 


ccataaggca 


1080 


agagttttgt 


aa 








1092 



£5 <210> 35 

Ifl <211> 1179 

fi <212> DNA 

Ul <213> Human immunodeficiency virus type 1 

~ 

H <400> 35 



atgagagtga 


aggagaaata 


tcagcacttg 


tggagatggg ggtggagatg gggcaccatg 


60 


ctccttggga 


tgttgatgat 


ctgtagtgct 


ggtgcgagag 


cgtcagtatt 


aagcggggga 


120 


gaattagatc 


gatgggaaaa 


aattcggtta 


aggccagggg 


gaaagaaaaa 


atataaatta 


180 


aaacatatag 


tatgggcaag 


cagggagcta 


gaacgattcg cagttaatcc 


tggcctgtta 


240 


gaaacatcag 


aaggctgtag 


acaaatactg 


ggacagctac 


aaccatccct 


tcagacagga 


300 


tcagaagaac 


ttagatcatt 


atataataca 


gtagcaaccc 


tctattgtgt 


gcatcaaagg 


360 


atagagataa 


aagacaccaa 


ggaagcttta 


gacaagatag 


aggaagagca 


aaacaaaagt 


420 


aagaaaaaag 


cacagcaagc 


agcagctgac 


acaggacaca 


gcagtcaggt 


cagccaaaat 


480 


taccctatag 


tgcagaacat 


ccaggggcaa 


atggtacatc 


aggccatatc 


acctagaact 


540 


ttaaatgcat 


gggtaaaagt 


agtagaagag 


aaggctttca 


gcccagaagt 


aatacccatg 


600 


ttttcagcat 


tatcagaagg 


agccacccca 


caagatttaa 


acaccatgct 


aaacacagtg 


660 


gggggacatc 


aagcagccat 


gcaaatgtta 


aaagagacca 


tcaatgagga 


agctgcagaa 


720 


tgggatagag 


tacatccagt 


gcatgcaggg 


cctattgcac 


caggccagat 


gagagaacca 


780 





aggggaagtg 


acatagcagg 


aactactagt 


acccttcagg 


aacaaatagg 


atggatgaca 


840 




aataatccac 


ctatcccagt 


aggagaaatt 


tataaaagat 


ggataatcct 


gggattaaat 


900 




aaaatagtaa 


gaatgtatag 


ccctaccagc 


attctggaca 


taagacaagg 


accaaaagaa 


960 




ccttttagag 


actatgtaga 


ccggttctat 


aaaactctaa 


gagccgagca 


agcttcacag 


1020 




gaggtaaaaa 


attggatgac 


agaaaccttg 


ttggtccaaa 


atgcgaaccc 


agattgtaag 


1080 




actattttaa aagcattggg accagcggct 


acactagaag aaatgatgac 


agcatgtcag 


1140 




ggagtaggag gacccggcca taaggcaaga gttttgtaa 






1179 




<210> 36 
<211> 1308 
<212> DNA 

<213> Human immunodeficiency virus type 1 










<400> 36 
atgagagtga 


aggagaaata 


tcagcacttg 


tggagatggg ggtggagatg gggcaccatg 


60 




ctccttggga 


tgttgatgat 


ctgtagtgct 


ggtgcgagag 


cgtcagtatt 


aagcggggga 


120 


ri 


gaattagatc 


gatgggaaaa 


aattcggtta 


aggccagggg 


gaaagaaaaa 


atataaatta 


180 


J 

bl 


aaacatatag 


tatgggcaag 


cagggagcta 


gaacgattcg 


cagttaatcc 


tggcctgtta 


240 


gaaacatcag 


aaggctgtag 


acaaatactg 


ggacagctac 


aaccatccct 


tcagacagga 


300 


w 


tcagaagaac 


ttagatcatt 


atataataca 


gtagcaaccc 


tctattgtgt 


gcatcaaagg 


360 


r 


atagagataa 


aagacaccaa 


ggaagcttta 


gacaagatag aggaagagca 


aaacaaaagt 


420 




aagaaaaaag 


cacagcaagc 


agcagctgac 


acaggacaca gcagtcaggt 


cagccaaaat 


480 




taccctatag 


tgcagaacat 


ccaggggcaa 


atggtacatc 


aggccatatc 


acctagaact 


540 


H 


ttaaatgcat 


gggtaaaagt 


agtagaagag 


aaggctttca 


gcccagaagt 


aatacccatg 


600 




ttttcagcat 


tatcagaagg 


agccacccca 


caagatttaa 


acaccatgct 


aaacacagtg 


660 




gggggacatc 


aagcagccat 


gcaaatgtta 


aaagagacca 


tcaatgagga 


agctgcagaa 


720 




tgggatagag 


tacatccagt 


gcatgcaggg 


cctattgcac 


caggccagat 


gagagaacca 


780 




aggggaagtg 


acatagcagg 


aactactagt 


acccttcagg 


aacaaatagg 


atggatgaca 


840 




aataatccac 


ctatcccagt 


aggagaaatt 


tataaaagat 


ggataatcct 


gggattaaat 


900 




aaaatagtaa 


gaatgtatag 


ccctaccagc 


attctggaca 


taagacaagg 


accaaaagaa 


960 




ccttttagag 


actatgtaga 


ccggttctat 


aaaactctaa gagccgagca agcttcacag 


1020 




gaggtaaaaa 


attggatgac 


agaaaccttg 


ttggtccaaa 


atgcgaaccc 


agattgtaag 


1080 




actattttaa 


aagcattggg 


accagcggct 


acactagaag 


aaatgatgac 


agcatgtcag 


1140 




ggagtaggag 


gacccggcca 


taaggcaaga 


gttttgttat 


tcataatgat 


agtaggaggc 


1200 



ttggtaggtt taagaatagt ttttgctgta ctttctgtag tgaatagagt taggcaggga 1260 
tattcaccat tatcgtttca gacccacctc ccaatcccga ggggataa 1308 

<210> 37 
<211> 363 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 37 

Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp 
15 10 15 

Glu Lys He Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys 
20 25 30 

His He Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 
35 ^ 40 45 

O Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin He Leu Gly Gin Leu 
n 50 55 60 

W Gin Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 

U 65 70 75 80 

yi Thr Val Ala Thr Leu Tyr Cys Val His Gin Arg He Glu He Lys Asp 

^ 85 90 95 

O Thr Lys Glu Ala Leu Asp Lys He Glu Glu Glu Gin Asn Lys Ser Lys 

P 100 ~ 105 HO 

M> Lvs Lys Ala Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val 

115 120 125 

Ser Gin Asn Tyr Pro He Val Gin Asn He Gin Gly Gin Met Val His 
13 0 135 140 

Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu 
145 150 155 160 

Glu Lys Ala Phe Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser 
165 170 175 

Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly 
180 185 190 



Gly His Gin Ala Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu 



195 



200 



205 



Ala Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro He Ala 
210 ' 215 22 0 

Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr 
225 230 235 240 

Ser Thr Leu Gin Glu Gin He Gly Trp Met Thr Asn Asn Pro Pro He 
245 250 255 



Pro Val Gly Glu He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys 
260 265 270 

He Val Arg Met Tyr Ser Pro Thr Ser He Leu Asp He Arg Gin Gly 
275 280 285 

Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu 
290 295 300 

Arq Ala Glu Gin Ala Ser Gin Glu Val Lys Asn Trp Met Thr Glu Thr 
305 310 315 320 

Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Lys Ala 
325 330 335 

Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly 
340 345 350 



Val Gly Gly Pro Gly His Lys Ala Arg Val Leu 
355 360 



<210> 38 
<211> 410 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 38 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
1 " 5 10 15 

Trp Gly Thr Met Leu Leu Gly Met Leu Met He Cys Ser Ala Gly Ala 
20 25 30 



Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp Glu Lys He 
35 40 45 



Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys His He Val 
50 J 55 60 



Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro Gly Leu Leu 
65 ^ 70 75 80 

Glu Thr Ser Glu Gly Cys Arg Gin He Leu Gly Gin Leu Gin Pro Ser 
85 90 95 

Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala 
100 105 HO 

Thr Leu Tyr Cys Val His Gin Arg He Glu He Lys Asp Thr Lys Glu 
115 120 125 

Ala Leu Asp Lys He Glu Glu Glu Gin Asn Lys Ser Lys Lys Lys Ala 
130 ~ 135 140 

Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val Ser Gin Asn 
145 150 155 160 

Tyr Pro Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val Ser 
165 170 175 

Gin Asn Tyr Pro Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin 
180 185 190 

Val Ser Gin Asn Tyr Pro He Val Gin Asn He Gin Gly Gin Met Val 
195 200 205 

His Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val 
210 215 220 

Glu Glu Lys Ala Phe Ser Pro Glu Val He Pro Met Phe Ser Ala Leu 
225 230 235 240 

Ser Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val 
245 250 255 

Gly Gly His Gin Ala Ala Met Gin Met Leu Lys Glu Thr He Asn Glu 
260 265 270 

Glu Ala Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro He 
275 280 285 



Ala Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp lie Ala Gly Thr 
290 295 300 



Thr Ser Thr Leu Gin Glu Gin lie Gly Trp Met Thr Asn Asn Pro Pro 
305 310 315 320 



lie Pro Val Gly Glu lie Tyr Lys Arg Trp lie lie Leu Gly Leu Asn 
325 330 335 



Lys lie Val Arg Met Tyr Ser Pro Thr Ser lie Leu Asp lie Arg Gin 
340 345 350 



Gly 'Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr 
355 360 365 



Leu Arg Ala Glu Gin Ala Ser Gin Glu Val Thr lie Leu Lys Ala Leu 
370 375 380 



Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val 
385 390 395 400 



Gly Gly Pro Gly His Lys Ala Arg Val Leu 
405 ~ 410 



<210> 39 
<211> 453 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 39 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
15 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Gly Ala 
20 25 30 



Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp Glu Lys lie 
35 40 45 



Arg Leu Arg Pro Gly Gly Leu Ser Gly Gly Glu Leu Asp Arg Trp Glu 
50 55 60 



Lys lie Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys His 
65 70 75 80 



lie Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro Gly 
85 90 95 



Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin He Leu Gly Gin Leu Gin 
100 105 HO 



Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr 
115 120 125 

Val Ala Thr Leu Tyr Cys Val His Gin Arg He Glu He Lys Asp Thr 

130 " 135 140 



Lys Glu Ala Leu Asp Lys He Glu Glu Glu Gin Asn Lys Ser Lys Lys 
145 150 155 160 

Lys Ala Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val Ser 
165 170 175 



Gin Asn Tyr Pro He Val Gin Asn He Gin Gly Gin Met Val His Gin 
180 185 190 

Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu 
195 ~ 200 205 

Lys Ala Phe Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser Glu 
210 215 220 

Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly 
225 230 235 240 

His Gin Ala Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala 
245 250 255 



Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro He Ala Pro 
260 " 265 270 

Gly Gin Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser 
275 280 285 

Thr Leu Gin Glu Gin He Gly Trp Met Thr Asn Asn Pro Pro He Pro 
290 295 300 

Val Gly Glu He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys He 
305 310 ~ 315 320 



Val Arq Met Tyr Ser Pro Thr Ser He Leu Asp He Arg Gin Gly Pro 
325 330 335 



Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg 
340 345 350 



Ala Glu Gin Ala Ser Gin Glu Val Lys Asn Trp Met Thr Glu Thr Leu 
355 360 365 

Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Lys Ala Leu 
370 375 380 

Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val 
385 390 395 400 

Gly Gly Pro Gly His Lys Ala Arg Val Leu Leu Phe He Met He Val 
405 410 415 

Gly Gly Leu Val Gly Leu Arg He Val Phe Ala Val Leu Ser Val Val 
420 425 430 





Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu 




S 5 


435 440 


445 




Ul 


Pro He Pro Arg Gly 






til 


450 








<210> 40 








<211> 399 








<212> DNA 








<213> Human immunodeficiency virus type 1 






<400> 40 


ggagaattag atcgatggga aaaaattcgg 


60 




atgggtgcga gagcgtcagt attaagcggg 




ttaaggccag ggggaaagaa aaaatataaa 


ttaaaacata tagtatgggc aagcagggag 


120 




ctagaacgat tcgcagttaa tcctggcctg 


ttagaaacat cagaaggctg tagacaaata 


180 




ctgggacagc tacaaccatc ccttcagaca 


ggatcagaag aacttagatc attatataat 


240 




acagtagcaa ccctctattg tgtgcatcaa 


aggatagaga taaaagacac caaggaagct 


300 




ttagacaaga tagaggaaga gcaaaacaaa 


agtaagaaaa aagcacagca agcagcagct 


360 




gacacaggac acagcagtca ggtcagccaa 


aattactaa 


399 



<210> 41 
<211> 486 
<212> DNA 

<213> Human immunodeficiency virus type 1 

atgagagtga aggagaaata tcagcacttg tggagatggg ggtggagatg gggcaccatg 60 



ctccttggga tgttgatgat ctgtagtgct ggtgcgagag cgtcagtatt aagcggggga 
gaattagatc gatgggaaaa aattcggtta aggccagggg gaaagaaaaa atataaatta 
aaacatatag tatgggcaag cagggagcta gaacgattcg cagttaatcc tggcctgtta 
gaaacatcag aaggctgtag acaaatactg ggacagctac aaccatccct tcagacagga 
tcagaagaac ttagatcatt atataataca gtagcaaccc tctattgtgt gcatcaaagg 
atagagataa aagacaccaa ggaagcttta gacaagatag aggaagagca aaacaaaagt 
aagaaaaaag cacagcaagc agcagctgac acaggacaca gcagtcaggt cagccaaaat 
tactaa 

<210> 42 
<211> 615 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 42 

atgagagtga aggagaaata tcagcacttg tggagatggg ggtggagatg gggcaccatg 
ctccttggga tgttgatgat ctgtagtgct ggtgcgagag cgtcagtatt aagcggggga 
gaattagatc gatgggaaaa aattcggtta aggccagggg gaaagaaaaa atataaatta 
aaacatatag tatgggcaag cagggagcta gaacgattcg cagttaatcc tggcctgtta 
gaaacatcag aaggctgtag acaaatactg ggacagctac aaccatccct tcagacagga 
tcagaagaac ttagatcatt atataataca gtagcaaccc tctattgtgt gcatcaaagg 
atagagataa aagacaccaa ggaagcttta gacaagatag aggaagagca aaacaaaagt 
aagaaaaaag cacagcaagc agcagctgac acaggacaca gcagtcaggt cagccaaaat 
tacttattca taatgatagt aggaggcttg gtaggtttaa gaatagtttt tgctgtactt 
tctgtagtga atagagttag gcagggatat tcaccattat cgtttcagac ccacctccca 
atcccgaggg gataa 

<210> 43 
<211> 132 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 43 

Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp 
15 10 15 

Glu Lys He Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys 



His lie Val Trp Ala Ser Arg Glu 
35 40 



Gly Leu Leu Glu Thr Ser Glu Gly 
50 55 



Gin Pro Ser Leu Gin Thr Gly Ser 
65 70 



Thr Val Ala Thr Leu Tyr Cys Val 
85 



Thr Lys Glu Ala Leu Asp Lys lie 
100 



Lys Lys Ala Gin Gin Ala Ala Ala 
115 120 



Ser Gin Asn Tyr 
130 



<210> 
<211> 
<212> 
<213> 

<400> 

Met Arg 
1 



Trp Gly 



Arg Ala 



Arg Leu Arg Pro Gly Gly Lys Lys 
50 55 



Trp Ala Ser Arg Glu Leu Glu Arg 
65 70 



Glu Thr Ser Glu Gly Cys Arg Gin 
85 



Leu Glu Arg Phe Ala Val Asn Pro 
45 



Cys Arg Gin lie Leu Gly Gin Leu 
60 



Glu Glu Leu Arg Ser Leu Tyr Asn 
75 ~ 80 



His Gin Arg lie Glu lie Lys Asp 
90 " 95 



Glu Glu Glu Gin Asn Lys Ser Lys 
105 110 



Asp Thr Gly His Ser Ser Gin Val 
125 



Lys Tyr Lys Leu Lys His lie Val 
60 



Phe Ala Val Asn Pro Gly Leu Leu 
75 80 



lie Leu Gly Gin Leu Gin Pro Ser 
90 95 



44 

179 

PRT 

Human immunodeficiency virus type 1 



44 



Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
5 ~ 10 15 



Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Gly Ala 
20 25 30 



Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp Glu Lys lie 
35 40 45 



Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Gly Gin Leu Gin 



100 



105 



110 



Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr 
115 120 125 



Val Ala Thr Leu Tyr Cys Val His Gin Arg lie Glu lie Lys Asp Thr 
130 " 135 140 



Lys Glu Ala Leu Asp Lys lie Glu Glu Glu Gin Asn Lys Ser Lys Lys 
145 150 155 160 



Lys Ala Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val Ser 
165 170 175 



Gin Asn Tyr 



<210> 45 
<211> 186 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 45 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
15 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Gly Ala 
20 25 30 



Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp Glu Lys lie 
35 40 45 



Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys His lie Val 
50 55 60 



Trp Ala Ser Arg Glu Leu Glu Arg Gly Gin Leu Gin Pro Ser Leu Gin 
65 70 75 80 



Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn Thr Val Ala Thr Leu 
85 90 95 



Tyr Cys Val His Gin Arg lie Glu lie Lys Asp Thr Lys Glu Ala Leu 
100 105 110 



Asp Lys lie Glu Glu Glu Gin Asn Lys Ser Lys Lys Lys Ala Gin Gin 
115 120 125 



Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val Ser Gin Asn Tyr Leu 
130 135 140 



Phe He Met He Val Gly Gly Leu Val Gly Leu Arg He Val Phe Ala 
145 150 155 160 



Val Leu Ser Val Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser 
165 170 175 



Phe Gin Thr His Leu. Pro He Pro Arg Gly 
180 185 



<210> 46 
<211> 699 
<212> DNA 

<213> Human immunodeficiency virus type 1 








<400> 46 
atgcctatag 


tgcagaacat 


ccaggggcaa 


atggtacatc 


aggccatatc 


acctagaact 


60 


ttaaatgcat 


gggtaaaagt 


agtagaagag 


aaggctttca 


gcccagaagt 


aatacccatg 


120 


ttttcagcat 


tatcagaagg 


agccacccca 


caagatttaa 


acaccatgct 


aaacacagtg 


180 


gggggacatc 


aagcagccat 


gcaaatgtta 


aaagagacca 


tcaatgagga 


agctgcagaa 


240 


tgggatagag 


tacatccagt 


gcatgcaggg 


cctattgcac 


caggccagat 


gagagaacca 


300 


aggggaagtg 


acatagcagg 


aactactagt 


acccttcagg 


aacaaatagg 


atggatgaca 


360 


aataatccac 


ctatcccagt 


aggagaaatt 


tataaaagat ggataatcct 


gggattaaat 


420 


aaaatagtaa 


gaatgtatag 


ccctaccagc 


attctggaca 


taagacaagg 


accaaaagaa 


480 


ccttttagag 


actatgtaga 


ccggttctat 


aaaactctaa 


gagccgagca 


agcttcacag 


540 


gaggtaaaaa 


attggatgac 


agaaaccttg 


ttggtccaaa 


atgcgaaccc 


agattgtaag 


600 


actattttaa 


aagcattggg 


accagcggct 


acactagaag 


aaatgatgac 


agcatgtcag 


660 


ggagtaggag 


gacccggcca 


taaggcaaga 


gttttgtaa 






699 



<210> 47 
<211> 786 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 47 

atgagagtga aggagaaata tcagcacttg tggagatggg ggtggagatg gggcaccatg 60 
ctccttggga tgttgatgat ctgtagtgct cctatagtgc agaacatcca ggggcaaatg 12 0 
gtacatcagg ccatatcacc tagaacttta aatgcatggg taaaagtagt agaagagaag 180 
gctttcagcc cagaagtaat acccatgttt tcagcattat cagaaggagc caccccacaa 240 





gatttaaaca 


ccatgctaaa 


cacagtgggg ggacatcaag 


cagccatgca 


aatgttaaaa 


300 




gagaccatca 


atgaggaagc 


tgcagaatgg gatagagtac 


atccagtgca 


tgcagggcct 


360 




attgcaccag 


gccagatgag 


agaaccaagg 


ggaagtgaca 


tagcaggaac 


tactagtacc 


420 




cttcaggaac 


aaataggatg gatgacaaat 


aatccaccta 


tcccagtagg agaaatttat 


480 




aaaagatgga 


taatcctggg 


attaaataaa 


atagtaagaa 


tgtatagccc 


taccagcatt 


540 




ctggacataa 


gacaaggacc 


aaaagaacct 


tttagagact 


atgtagaccg gttctataaa 


600 




actctaagag 


ccgagcaagc 


ttcacaggag gtaaaaaatt 


ggatgacaga 


aaccttgttg 


r tz r\ 
660 




gtccaaaatg 


cgaacccaga 


ttgtaagact 


attttaaaag 


cattgggacc 


agcggctaca 


720 




ctagaagaaa 


tgatgacagc 


atgtcaggga gtaggaggac 


ccggccataa 


ggcaagagtt 


780 




ttgtaa 












786 


, r ...„. 

o 

pa 


<210> 48 
<211> 915 
<212> DNA 

<213> Human immunodeficiency virus type 1 










<400> 48 
atgagagtga 


aggagaaata 


tcagcacttg 


tggagatggg ggtggagatg gggcaccatg 


60 


w 

In 


ctccttggga 


tgttgatgat 


ctgtagtgct 


cctatagtgc 


agaacatcca 


ggggcaaatg 


120 




gtacatcagg 


ccatatcacc 


tagaacttta 


aatgcatggg 


taaaagtagt 


agaagagaag 


180 




gctttcagcc 


cagaagtaat 


acccatgttt 


tcagcattat 


cagaaggagc 


caccccacaa 


240 


: -— • 


gatttaaaca 


ccatgctaaa 


cacagtgggg 


ggacatcaag 


cagccatgca aatgttaaaa 


300 




gagaccatca 


atgaggaagc 


tgcagaatgg 


gatagagtac 


atccagtgca tgcagggcct 


360 




attgcaccag 


gccagatgag 


agaaccaagg 


ggaagtgaca 


tagcaggaac 


tactagtacc 


420 




cttcaggaac 


aaataggatg 


gatgacaaat 


aatccaccta 


tcccagtagg 


agaaatttat 


480 




aaaagatgga 


taatcctggg 


attaaataaa 


atagtaagaa 


tgtatagccc 


taccagcatt 


540 




ctggacataa 


gacaaggacc 


aaaagaacct 


tttagagact 


atgtagaccg 


gttctataaa 


600 




actctaagag 


ccgagcaagc 


ttcacaggag 


gtaaaaaatt 


ggatgacaga 


aaccttgttg 


660 




gtccaaaatg 


cgaacccaga 


ttgtaagact 


attttaaaag cattgggacc 


agcggctaca 


720 




ctagaagaaa 


tgatgacagc 


atgtcaggga 


gtaggaggac 


ccggccataa ggcaagagtt 


780 




ttgttattca 


taatgatagt 


aggaggcttg 


gtaggtttaa 


gaatagtttt 


tgctgtactt 


840 




tctgtagtga 


atagagttag 


gcagggatat 


tcaccattat 


cgtttcagac 


ccacctccca 


900 




atcccgaggg 


gataa 










915 



<210> 49 
<211> 232 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 49 

Met Pro He Val Gin Asn He Gin Gly Gin Met Val His Gin Ala lie 
15 10 15 



Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala 
20 25 30 



Phe Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala 
35 40 45 

Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin 
50 * 55 60 

M Ala Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu 

fl 65 70 75 80 

.«(. 

H Trp Asp Arg Val His Pro Val His Ala Gly Pro He Ala Pro Gly Gin 

yy 85 90 95 

In Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu 

10 0 105 110 

Gin Glu Gin He Gly Trp Met Thr Asn Asn Pro Pro He Pro Val Gly 
p 115 120 125 



Glu He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys He Val Arg 
130 135 140 



Met Tyr Ser Pro Thr Ser He Leu Asp He Arg Gin Gly Pro Lys Glu 
145 150 155 160 



Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu 
165 170 175 



Gin Ala Ser Gin Glu Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val 
180 185 190 



Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro 
195 200 205 



Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly 
210 215 220 



Pro Gly His Lys Ala Arg Val Leu 
225 230 



<210> 50 

<211> 261 

<212> PRT 

<213> Human immunodeficiency virus type 1 

<400> 50 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
1 5 10 *' 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Pro lie 
20 ~ 25 30 



Val Gin Asn He Gin Gly Gin Met Val His Gin Ala He Ser Pro Arg 
35 40 45 



Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro 
50 55 60 



Glu Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin 
65 70 75 80 



Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala Ala Met 
85 90 95 



Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg 
100 105 110 



Val His Pro Val His Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu 
115 120 125 



Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin 
130 ~ 135 " 140 



He Gly Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu He Tyr 
145 ~ 150 155 160 



Lys Arg Trp He He Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser 
165 170 175 



Pro Thr Ser lie Leu Asp He Arg Gin Gly Pro Lys Glu Pro Phe Arg 
180 185 190 



Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Ser 
195 200 205 



Gin Glu Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala 
210 ~ 215 220 



Asn Pro Asp Cys Lys Thr lie Leu Lys Ala Leu Gly Pro Ala Ala Thr 
225 230 235 240 



Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His 
245 250 255 



Lys Ala Arg Val Leu 
260 



<210> 51 
<211> 286 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 51 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
15 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Pro lie 
20 25 * 30 



Val Gin Asn He Gin Gly Gin Met Val His Gin Ala He Ser Pro Arg 
35 40 45 



Thr Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro 
50 55 60 



Glu Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin 
65 70 75 80 



Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala Ala Met 
85 90 95 



Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Arg Glu Pro Arg 
100 105 110 



Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin He Gly 
115 12 0 125 



Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu He Tyr Lys Arg 
130 135 140 



Trp He lie Leu Gly Leu Asn Lys lie Val Arg Met Tyr Ser Pro Thr 
145 150 155 160 



Ser lie Leu Asp He Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr 
165 ~ ~ 170 175 

Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Ser Gin Glu 
180 185 190 

Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala Asn Pro 
195 200 2 05 

Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu Glu 
210 215 220 

Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His Lys Ala 
U 225 230 235 240 

i i 

Zl Arg Val Leu Leu Phe He Met He Val Gly Gly Leu Val Gly Leu Arg 

245 250 255 

yj He Val Phe Ala Val Leu Ser Val Val Asn Arg Val Arg Gin Gly Tyr 

111 260 265 270 

s 

!** Ser Pro Leu Ser Phe Gin Thr His Leu Pro He Pro Arg Gly 

275 280 285 

r% 

f«s <210> 52 

rf <211> 3839 

? ™ <212> DNA 

<213> Artificial sequence 

<220> 

<223> Modified Env/Tat 
<400> 52 

gaattctgca acaactgctg tttatccatt ttcagaattg ggtgtcgaca tagcagaata 60 

ggcgttactc gacagaggag agcaagaaat ggagccagta gatcctagac tagagccctg 12 0 

gaagcatcca ggaagtcagc ctaaaactgc ttgtaccaat tgctattgta aaaagtgttg 180 

ctttcattgc caagtttgtt tcataacaaa agccttaggc atctcctatg gcaggaagaa 240 

gcggagacag cgacgaagac ctcctcaagg cagtcagact catcaagttt ctctatcaaa 300 

gcagtaagta gtacatgtaa tgcaacctat acaaatagca atagtagcat tagtagtagc 360 

aataataata gcaatagttg tgtggtccat agtaatcata gaatatagga aaatattaag 420 



acaaagaaaa 


atagacaggt 


taattgatag 


actaatagaa 


agagcagaag 


acagtggcaa 


480 


tgagagtgaa 


ggagaaatat 


cagcacttgt 


ggagatgggg 


gtggagatgg 


ggcaccatgc 


540 


tccttgggat 


gttgatgatc 


tgtagtgcta 


cagaaaaatt 


gtgggtcaca 


gtctattatg 


600 


gggtacctgt 


gtggaaggaa 


gcaaccacca 


ctctattttg 


tgcatcagat 


gctaaagcat 


660 


atgatacaga 


ggtacataat 


gtttgggcca 


cacatgcctg 


tgtacccaca 


gaccccaacc 


720 


cacaagaagt 


agtattggta 


aatgtgacag 


aaaattttaa 


catgtggaaa 


aatgacatgg 


780 


tagaacagat 


gcatgaggat 


ataatcagtt 


tatgggatca 


aagcctaaag 


ccatgtgtaa 


840 


aattaacccc 


actctgtgtt 


ggagctggta 


gttgtaacac 


ctcagtcatt 


acacaggcct 


900 


gtccaaaggt 


atcctttgag 


ccaattccca 


tacattattg 


tgccccggct 


ggttttgcga 


960 


ttctaaaatg 


taataataag 


acgttcaatg 


gaacaggacc 


atgtacaaat 


gtcagcacag 


1020 


tacaatgtac 


acatggaatt 


aggccagtag 


tatcaactca 


actgctgtta 


aatggcagtc 


1080 


tggcagaaga 


agaggtagta 


attagatctg 


ccaatttcac 


agacaatgct 


aaaaccataa 


1140 


tagtacagct 


gaaccaatct 


gtagaaatta 


attgtacaag 


acccaacaac 


aatacaagaa 


1200 


aaagtatccg 


tatccagaga 

Z? ZJ 


ggaccaggga 


gagcatttgt 


tacaatagga 


aaaataggaa 


1260 


atatgagaca 


agcacattgt 


ctcgggtgta 


ccagacctaa 


caacaataca 


agaaaaagtg 


1320 


tacgtatagg 


accaggacaa 


acattctatg 


caacaggtga 


tataataggg 


gatataagac 


1380 


aagcacattg 


ttgtacgaga 


cccaacaata 


atacaagaaa 


aagtataagg 


ataggaccag 


1440 


gacaagcatt 


ctatgcaaca 


ggagaaataa 


taggagatat 


aagacaagca 


cattgttgca 


1500 


caaggcccta 


caacaatata 


agacaaagga 


cccccatagg 


actagggcaa 


gcactctata 


1560 


caacaagaag 


aatagaagat 


ataagaagag 


cacattgttg 


taccagaccc 


tccaccaata 


1620 


caagaacaag 


tatacgtata 


ggaccaggac 


aagtattcta 


tagaacagga 


gacataacag 

Z) ZJ 


1680 


gagatataag 


aaaagcatat 


tgtggatcct 


gtacaagacc 


caacaacaat 


acaagaaaaa 


1740 


gaatatcttt 


aggaccagga 


cgagtatttt 


atacagcagg 


agaaataata 


ggagacatca 


1800 


gaaaggcaca 


ttgttgtacc 


agacctaata 


acaatacaag 


aaaaagtata 


acttttgcac 


1860 


caggacaagc 


gctctatgca 


acaggtgaaa 


taataggaga 


tataagacaa 


gcacattgtc 


1920 


tccrcratcrtac 


cagacctaac 


aacaatacaa 


gaaaaagtgt 


acgtatagga 


ccaggacaaa 


1980 


cattctatgc 


aacaggtgat 


ataatagggg 


atataagaca 


agcacattgt 


tgtacgagac 


2040 


ccaacaataa 


tacaagaaaa 


agtataagga 


taggaccagg 


acaagcattc 


tatgcaacag 


2100 


gagaaataat 


aggagatata 


agacaagcac 


attgttgcac 


aaggccctac 


aacaatataa 


2160 


gacaaaggac 


ccccatagga 


ctagggcaag 


cactctatac 


aacaagaaga 


atagaagata 


2220 


taagaagagc 


acattgttgt 


accagaccct 


ccaccaatac 


aagaacaagt 


atacgtatag 


2280 



gaccaggaca 


agtattctat 


agaacaggag 


acataacagg 


agatataaga 


aaagcatatt 


2340 


gtggatcctg 


tacaagaccc 


aacaacaata 


caagaaaaag 


aatatcttta 


ggaccaggac 


2400 


gagtatttta 


tacagcagga 


gaaataatag 


gagacatcag 


aaaggcacat 


tgttgtacca 


2460 


gacctaataa 


caatacaaga 


aaaagtataa 


cttttgcacc 


aggacaagcg 


ctctatgcaa 


2520 


caggtgaaat 


aataggagat 


ataagacaag 


cacattgtct 


cgggaacatt 


agtagagcaa 


2580 


aatggaataa 


cactttaaaa 


cagatagata 


gcaaattaag 


agaacaattt 


ggaaataata 


2640 


aaacaataat 


ctttaagcag 


tcctcaggag 


gggacccaga 


aattgtaacg 


cacagtttta 


2700 


attgtggagg 


ggaatttttc 


tactgtaatt 


caacacaact 


gtttaatagt 


acttggttta 


2760 


atagtacttg 


gagtactaaa 


gggtcaaata 


acactgaagg 


aagtgacaca 


atcaccctcc 


2820 


catgcagaat 


aaaacaaatt 


ataaacatgt 


ggcaggaagt 


aggaaaagca 


atgtatgccc 


2880 


ctcccatcag 


tggacaaatt 


agatgttcat 


caaatattac 


agggctgcta 


ttaacaagag 


2940 


atggtggtaa 


tagcaacaat 


gagtccgaga 


tcttcagacc 


tggaggagga 


gatatgaggg 


3000 


acaattggag 


aagtgaatta 


tataaatata 


aagtagtaaa 


aattgaacca 


ttaggagtag 


3060 


cacccaccaa 


ggcaaagaga 


agagtggtgc 


agactagtgc 


agtgggaata 


ggagctttgt 


3120 


tccttgggtt 


cttgggagca 


gcaggaagca 


ctatgggcgc 


agcgtcaatg 


acgctgacgg 


3180 


tacaggccag 


acaattattg 


tctggtatag 


tgcagcagca 


gaacaatttg 


ctgagggcta 


3240 


ttgaggcgca 


acagcatctg 


ttgcaactca 


cagtctgggg 


catcaagcag 


ctccaggcaa 


3300 


gaatcctggc 


tgtggaaaga 


tacctaaagg 


atcaacagct 


cctggggatt 


tggggttgct 


3360 


ctggaaaact 


catttgcacc 


actgctgtgc 


cttggaatgc 


tagttggagt 


aataaatctc 


3420 


tggaacagat 


ttggaataac 


atgacctgga 


tggagtggga 


cagagaaatt 


aacaattaca 


3480 


caagcttaat 


acactcctta 


attgaagaat 


cgcaaaacca 


gcaagaaaag 


aatgaacaag 


3540 


aattattgga 


attagataaa 


tgggcaagtt 


tgtggaattg 


gtttaacata 


acaaattggc 


3600 


tgtggtatat 


aaaattattc 


ataatgatag 


taggaggctt 


ggtaggttta 


agaatagttt 


3660 


ttgctgtact 


ttctgtagtg 


aatagagtta 


ggcagggata 


ttcaccatta 


tcgtttcaga 


3720 


cccacctccc 


aatcccgagg 


ggacccgaca 


ggcccgaagg 


aatagaagaa 


gaaggtggag 


3780 


agagagacag 


agacagatcc 


attcgattag 


tgaacggatc 


cttagcactt 


atctggtaa 


3839 



<210> 53 

<211> 1101 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> Modified Env/Tat 



<400> 53 



Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
15 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Thr Glu 
20 25 30 



Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
35 40 45 



Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
50 55 60 



Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65 70 75 80 



Pro Gin Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 
85 90 95 



Lys Asn Asp Met Val Glu Gin Met His Glu Asp lie lie Ser Leu Trp 
100 105 110 



Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Gly 
115 120 125 



Ala Gly Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys Pro Lys Val 

130 135 140 



Ser Phe Glu Pro lie Pro lie His Tyr Cys Ala Pro Ala Gly Phe Ala 
145 150 155 160 



lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr 
165 170 175 



Asn Val Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val Ser 
180 185 190 



Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val lie 
195 200 205 



Arg Ser Ala Asn Phe Thr Asp Asn Ala Lys Thr He He Val Gin Leu 
210 215 220 



Asn Gin Ser Val Glu He Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg 
225 230 235 240 



Lys Ser lie Arg lie Gin Arg Gly Pro Gly Arg Ala Phe Val Thr He 
245 250 255 



Gly Lys He Gly Asn Met Arg Gin Ala His Cys Leu Gly Cys Thr Arg 
260 265 270 



Pro Asn Asn Asn Thr Arg Lys Ser Val Arg He Gly Pro Gly Gin Thr 
275 280 285 



Phe Tyr Ala Thr Gly Asp He He Gly Asp He Arg Gin Ala His Cys 
290 295 300 



Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Arg He Gly Pro 
305 310 315 320 



Gly Gin Ala Phe Tyr Ala Thr Gly Glu He He Gly Asp He Arg Gin 
325 330 335 



Ala His Cys Cys Thr Arg Pro Tyr Asn Asn He Arg Gin Arg Thr Pro 
340 345 350 



He Gly Leu Gly Gin Ala Leu Tyr Thr Thr Arg Arg He Glu Asp He 
355 360 365 



Arg Arg Ala His Cys Cys Thr Arg Pro Ser Thr Asn Thr Arg Thr Ser 
370 375 380 



He Arg He Gly Pro Gly Gin Val Phe Tyr Arg Thr Gly Asp He Thr 
385 390 395 400 



Gly Asp He Arg Lys Ala Tyr Cys Gly Ser Cys Thr Arg Pro Asn Asn 
405 410 415 



Asn Thr Arg Lys Arg He Ser Leu Gly Pro Gly Arg Val Phe Tyr Thr 
420 425 430 



Ala Gly Glu He He Gly Asp He Arg Lys Ala His Cys Cys Thr Arg 
435 440 445 



Pro Asn Asn Asn Thr Arg Lys Ser He Thr Phe Ala Pro Gly Gin Ala 
450 455 460 



Leu Tyr Ala Thr Gly Glu He He Gly Asp He Arg Gin Ala His Cys 
465 470 475 480 



Leu Gly Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Val Arg lie 
485 490 ' 495 



Gly Pro Gly Gin Thr Phe Tyr Ala Thr Gly Asp lie lie Gly Asp lie 
500 * 505 510 



Arg Gin Ala His Cys Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser 
515 520 525 



lie Arg lie Gly Pro Gly Gin Ala Phe Tyr Ala Thr Gly Glu lie lie 
530 535 540 



Gly Asp lie Arg Gin Ala His Cys Cys Thr Arg Pro Tyr Asn Asn lie 
545 550 555 560 



Arg Gin Arg Thr Pro lie Gly Leu Gly Gin Ala Leu Tyr Thr Thr Arg 
565 570 575 



Arg lie Glu Asp lie Arg Arg Ala His Cys Cys Thr Arg Pro Ser Thr 
580 585 590 



Asn Thr Arg Thr Ser lie Arg lie Gly Pro Gly Gin Val Phe Tyr Arg 
595 600 605 



Thr Gly Asp lie Thr Gly Asp lie Arg Lys Ala Tyr Cys Gly Ser Cys 
610 '* 615 620 



Thr Arg Pro Asn Asn Asn Thr Arg Lys Arg lie Ser Leu Gly Pro Gly 
625 "* 630 * 635 640 



Arg Val Phe Tyr Thr Ala Gly Glu lie lie Gly Asp lie Arg Lys Ala 
645 650 655 



His Cys Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser lie Thr Phe 
660 " 665 670 



Ala Pro Gly Gin Ala Leu Tyr Ala Thr Gly Glu lie He Gly Asp He 
675 680 685 



Arg Gin Ala His Cys Leu Gly Asn He Ser Arg Ala Lys Trp Asn Asn 
690 695 700 



Thr Leu Lys Gin He Asp Ser Lys Leu Arg Glu Gin Phe Gly Asn Asn 
705 710 715 720 



Lys Thr lie lie Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu lie Val 
725 730 735 



Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr 
740 ~ ^ 745 " 750 



Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr Lys Gly 
755 760 765 



Ser Asn Asn Thr Glu Gly Ser Asp Thr lie Thr Leu Pro Cys Arg lie 
770 775 780 



Lys Gin lie He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala 
785 790 795 800 



Pro Pro He Ser Gly Gin He Arg Cys Ser Ser Asn He Thr Gly Leu 
805 810 815 



Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Asn Glu Ser Glu He Phe 
820 825 830 



Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
835 " 840 845 



Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
850 855 860 



Ala Lys Arg Arg Val Val Gin Thr Ser Ala Val Gly He Gly Ala Leu 
865 ~ 870 875 880 



Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser 
885 890 895 



Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Gly He Val Gin 
900 905 910 



Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Leu Leu 
915 920 925 



Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg He Leu Ala 
930 935 940 



Val Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly He Trp Gly Cys 
945 ~ 950 955 960 



Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp 



965 



970 



975 



O 



Ser Asn Lys Ser Leu Glu Gin lie Trp Asn Asn Met Thr Trp Met Glu 
980 985 990 



Trp Asp Arg Glu lie Asn Asn Tyr Thr Ser Leu lie His Ser Leu lie 
995 1000 1005 



Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu 
1010 1015 1020 



Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn He Thr 
1025 1030 1035 



Asn Trp Leu Trp Tyr He Lys Leu Phe He Met He Val Gly Gly 
1040 1045 1050 



Leu Val Gly Leu Arg He Val Phe Ala Thr His Leu Pro He Pro 
1055 1060 1065 



Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu Gly Gly Glu 
1070 1075 1080 



Arg Asp Arg Asp Arg Ser He Arg Leu Val Asn Gly Ser Leu Ala 
1085 1090 1095 



Leu He Trp 
1100 



<210> 54 
<211> 4040 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Modified Env/Tat/Rev 
<400> 54 

gaattctgca acaactgctg tttatccatt ttcagaattg ggtgtcgaca tagcagaata 6 0 

ggcgttactc gacagaggag agcaagaaat ggagccagta gatcctagac tagagccctg 120 
gaagcatcca ggaagtcagc ctaaaactgc ttgtaccaat tgctattgta aaaagtgttg 180 
ctttcattgc caagtttgtt tcataacaaa agccttaggc atctcctatg gcaggaagaa 240 
gcggagacag cgacgaagac ctcctcaagg cagtcagact catcaagttt ctctatcaaa 300 
gcagtaagta gtacatgtaa tgcaacctat acaaatagca atagtagcat tagtagtagc 360 
aataataata gcaatagttg tgtggtccat agtaatcata gaatatagga aaatattaag 420 



acaaagaaaa 
tgagagtgaa 
tccttgggat 
gggtacctgt 
atgatacaga 
cacaagaagt 
tagaacagat 
aattaacccc 
gtccaaaggt 
ttctaaaatg 
tacaatgtac 
tggcagaaga 
tagtacagct 
aaagtatccg 
atatgagaca 
tacgtatagg 
aagcacattg 
gacaagcatt 
caaggcccta 
caacaagaag 
caagaacaag 
gagatataag 
gaatatcttt 
gaaaggcaca 
caggacaagc 
tcgggtgtac 
cattctatgc 
ccaacaataa 
gagaaataat 
gacaaaggac 



atagacaggt 
ggagaaatat 
gttgatgatc 
gtggaaggaa 
ggtacataat 
agtattggta 
gcatgaggat 
actctgtgtt 
atcctttgag 
taataataag 
acatggaatt 
agaggtagta 
gaaccaatct 
tatccagaga 
agcacattgt 
accaggacaa 
ttgtacgaga 
ctatgcaaca 
caacaatata 
aatagaagat 
tatacgtata 
aaaagcatat 
aggaccagga 
ttgttgtacc 
gctctatgca 
cagacctaac 
aacaggtgat 
tacaagaaaa 
aggagatata 
ccccatagga 



taattgatag 
cagcacttgt 
tgtagtgcta 
gcaaccacca 
gtttgggcca 
aatgtgacag 
ataatcagtt 
ggagctggta 
ccaattccca 
acgttcaatg 
aggccagtag 
attagatctg 
gtagaaatta 
ggaccaggga 
ctcgggtgta 
acattctatg 
cccaacaata 
ggagaaataa 
agacaaagga 
ataagaagag 
ggaccaggac 
tgtggatcct 
cgagtatttt 
agacctaata 
acaggtgaaa 
aacaatacaa 
ataatagggg 
agtataagga 
agacaagcac 
ctagggcaag 



actaatagaa 
ggagatgggg 
cagaaaaatt 
ctctattttg 
cacatgcctg 
aaaattttaa 
tatgggatca 
gttgtaacac 
tacattattg 
gaacaggacc 
tatcaactca 
ccaatttcac 
attgtacaag 
gagcatttgt 
ccagacctaa 
caacaggtga 
atacaagaaa 
taggagatat 
cccccatagg 
cacattgttg 
aagtattcta 
gtacaagacc 
atacagcagg 
acaatacaag 
taataggaga 
gaaaaagtgt 
atataagaca 
taggaccagg 
attgttgcac 
cactctatac 



agagcagaag 
gtggagatgg 
gtgggtcaca 
tgcatcagat 
tgtacccaca 
catgtggaaa 
aagcctaaag 
ctcagtcatt 
tgccccggct 
atgtacaaat 
actgctgtta 
agacaatgct 
acccaacaac 
tacaatagga 
caacaataca 
tataataggg 
aagtataagg 
aagacaagca 
actagggcaa 
taccagaccc 
tagaacagga 
caacaacaat 
agaaataata 
aaaaagtata 
tataagacaa 
acgtatagga 
agcacattgt 
acaagcattc 
aaggccctac 
aacaagaaga 



acagtggcaa 
ggcaccatgc 
gtctattatg 
gctaaagcat 
gaccccaacc 
aatgacatgg 
ccatgtgtaa 
acacaggcct 
ggttttgcga 
gtcagcacag 
aatggcagtc 
aaaaccataa 
aatacaagaa 
aaaataggaa 
agaaaaagtg 
gatataagac 
ataggaccag 
cattgttgca 
gcactctata 
tccaccaata 
gacataacag 
acaagaaaaa 
ggagacatca 
acttttgcac 
gcacattgtc 
ccaggacaaa 
tgtacgagac 
tatgcaacag 
aacaatataa 
atagaagata 



taagaagagc 


acattgttgt 


accagaccct 


ccaccaatac 


aagaacaagt 


ataegtatag 


2280 


gaccaggaca 


agtattctat 


agaacaggag 


acataacagg 


agatataaga 


aaagcatatt 


2340 


gtggatcctg 

ZJ ~i ZJ _3 


tacaagaccc 


aacaacaata 


caagaaaaag 


aatatcttta 


ggaccaggac 


2400 


gagtatttta 


tacagcagga 


gaaataatag 


gagacatcag 

ZJ ZJ ZJ 


aaaggcacat 


tgttgtacca 


2460 


gacctaataa 


caatacaaga 


aaaagtataa 


cttttgcacc 


aggacaagcg 


etctatgeaa 


2520 


caggtgaaat 


aataggagat 


ataagacaag 


cacattgtct 


cgggaacatt 


agtagagcaa 


2580 


aatggaataa 


cactttaaaa 


cagatagata 


gcaaattaag 


agaacaattt 


ggaaataata 


2640 


aaacaataat 


ctttaagcag 


tcctcaggag 


gggacccaga 


aattgtaacg 


cacagtttta 


2700 


attqtqqaqq 


ggaatttttc 


tactgtaatt 


caacacaact 


gtttaatagt 


acttggttta 


2760 


atagtacttg 


gagtactaaa 


gggtcaaata 


acactgaagg 


aagtgacaca 


atcaccctcc 


2820 


catgcagaat 


aaaacaaatt 


ataaacatgt 


qqcaqqaaqt 


aggaaaagca 


atgtatgccc 


2880 


ctcccatcag 


tggacaaatt 


agatgttcat 


caaatattac 


aqqqctqcta 


ttaacaagag 


2940 


atqqtaataa 


tagcaacaat 


aaatccaaaa 


tcttcagacc 


t aqacfaacrcfa 


qa t atqaqqq 


3000 


acaattaaao 


aaataaatta 


tataaatata 


aaataataaa 


aattaaacca 


ttaaaaataa 


3060 


cacccaccaa 

V** W V^c w C4. w OA CA. 


aacaaaoaaa 




aaactaatqc 


aatooqaata 


ggagctttgt 


3120 


tccttgggtt 


cttgggagca 


gc agga ag c a 


ctataqqcta 


cacgtcaatg 


acqctqacqq 

ZJ ZD j 


3180 


tacaggccag 


acaattafctg 


tctgatatag 


tgcagcagca 


gaacaatttg 


ctqaqqqcta 

v— ^Zj ZJZJZJ 


3240 


ttgaggcgca 


acagcafcctg 


ttgcaactca 


cagtctgggg 


catcaaacag 


ctccaggcaa 


3300 


aaatcctacjc 

~J t**. wv w ^ w 


tatqqaaaqa 


tacctaaagg 


atcaacagct 


cc toaqqatt 


tqqqqttqct 

^ZJZJZJZ) ^ "-t^^- 


3360 


ctggaaaact 


catttgcacc 


actgctgtgc 


cttggaatgc 


tagttggagt 


aataaatctc 


3420 


taaaacaaat 

Zj 


ttaqaataac 


atgacctgga 


fcqqaqtqqqa 


cagagaaafct 


aacaattaca 


3480 


caaacttaat 


acactcctta 


attgaagaat 


cgcaaaacca 


gcaagaaaag 


aatgaacaag 


3540 


a a 1 1 a 1 1 aaa 


atfaoataaa 


tgggcaagtt 


tgtggaattg 


gtttaacata 


acaaattggc 


3600 


tcrtacrt" a1~ at - 


aaaatfcattc 


ataataataa 


taggaggcfct 


ggtaggt tta 


agaatagtfct 


3660 


1 1~cjpt"crt" act" 


1~'t"pt"a't"acr1~cr 

l— v— lw CL 1— OL^ 


aaf acracrt*t"a 


crcrr* a arocr a t a 


tt" ca era t"t a 


tcatttcaaa 


3720 


rfr , car , r , tr , r , p 

O- ^ V^UUV/l/ 


a at" pppcraacr 


craacpccraca 


aacccaaaaa 


aataaaaaaa 


aaaaatcfaaa 


3780 


actaaaaacaa 


aaacaaatcc 


attccrattaa 


fcaaa caaatc 


ct - taaractt 


atctaaaaca 


3840 


atctgcggag 


cctgtgcctc 


ttcagctacc 


accgcttgag 


agacttactc 


ttgattgtaa 


3900 


cgaggattgt 


ggaacttctg 


ggacgcaggg 


ggtgggaagc 


cctcaaatat 


tggtggaatc 


3960 


tcctacagta 


ttggagtcag 


gaactaaaga 


atagtgctgt 


taacttgetc 


aatgccacag 


4020 


ccatagcagt 


agctgagtaa 










4040 



1,1 



<210> 55 

<211> 1186 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Modified Env/Tat/Rev 

<400> 55 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Arg 
1 5 10 15 



Trp Gly Thr Met Leu Leu Gly Met Leu Met lie Cys Ser Ala Thr Glu 
20 25 30 



Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
35 40 45 



Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
50 55 60 



Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65 70 75 80 



Pro Gin Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 
85 90 95 



Lys Asn Asp Met Val Glu Gin Met His Glu Asp lie lie Ser Leu Trp 
100 105 110 



Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Gly 
115 ~ 120 125 



Ala Gly Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys Pro Lys Val 
130 ' 135 140 



Ser Phe Glu Pro lie Pro lie His Tyr Cys Ala Pro Ala Gly Phe Ala 
145 150 155 160 



lie Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr 
165 170 175 



Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val Ser 
180 185 190 



Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val He 



195 



200 



205 



Arg Ser Ala Asn Phe Thr Asp Asn Ala Lys Thr lie lie Val Gin Leu 
210 215 220 



Asn Gin Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg 
225 230 235 240 



Lys Ser lie Arg lie Gin Arg Gly Pro Gly Arg Ala Phe Val Thr lie 
245 250 255 



Gly Lys lie Gly Asn Met Arg Gin Ala His Cys Leu Gly Cys Thr Arg 
260 265 270 



Pro Asn Asn Asn Thr Arg Lys Ser Val Arg lie Gly Pro Gly Gin Thr 
275 280 285 



Phe Tyr Ala Thr Gly Asp lie lie Gly Asp lie Arg Gin Ala His Cys 
290 295 300 



Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser lie Arg lie Gly Pro 
305 310 315 320 



Gly Gin Ala Phe Tyr Ala Thr Gly Glu He He Gly Asp He Arg Gin 
325 330 335 



Ala His Cys Cys Thr Arg Pro Tyr Asn Asn He Arg Gin Arg Thr Pro 
340 345 350 



He Gly Leu Gly Gin Ala Leu Tyr Thr Thr Arg Arg He Glu Asp He 
355 360 365 



Arg Arg Ala His Cys Cys Thr Arg Pro Ser Thr Asn Thr Arg Thr Ser 
370 375 380 



He Arg He Gly Pro Gly Gin Val Phe Tyr Arg Thr Gly Asp He Thr 
385 " 390 395 400 



Gly Asp He Arg Lys Ala Tyr Cys Gly Ser Cys Thr Arg Pro Asn Asn 
405 410 415 



Asn Thr Arg Lys Arg lie Ser Leu Gly Pro Gly Arg Val Phe Tyr Thr 
420 425 430 



Ala Gly Glu He He Gly Asp He Arg Lys Ala His Cys Cys Thr Arg 
435 440 445 



Pro Asn Asn Asn Thr Arg Lys Ser lie Thr Phe Ala Pro Gly Gin Ala 
450 455 460 



Leu Tyr Ala Thr Gly Glu lie lie Gly Asp lie Arg Gin Ala His Cys 
465 470 475 480 



Leu Gly Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Val Arg lie 
485 490 495 



Gly Pro Gly Gin Thr Phe Tyr Ala Thr Gly Asp lie lie Gly Asp lie 
500 505 510 



Arg Gin Ala His Cys Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser 
515 * 520 525 



lie Arg lie Gly Pro Gly Gin Ala Phe Tyr Ala Thr Gly Glu lie lie 
530 535 540 



Gly Asp lie Arg Gin Ala His Cys Cys Thr Arg Pro Tyr Asn Asn lie 
545 ~ 550 J 555 560 



Arg Gin Arg Thr Pro lie Gly Leu Gly Gin Ala Leu Tyr Thr Thr Arg 
565 570 575 



Arg lie Glu Asp lie Arg Arg Ala His Cys Cys Thr Arg Pro Ser Thr 
580 585 590 



Asn Thr Arg Thr Ser lie Arg He Gly Pro Gly Gin Val Phe Tyr Arg 
595 600 605 



Thr Gly Asp He Thr Gly Asp He Arg Lys Ala Tyr Cys Gly Ser Cys 
610 615 620 



Thr Arg Pro Asn Asn Asn Thr Arg Lys Arg He Ser Leu Gly Pro Gly 
625 630 635 640 



Arg Val Phe Tyr Thr Ala Gly Glu He He Gly Asp He Arg Lys Ala 
645 650 655 



His Cys Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Thr Phe 
660 665 670 



Ala Pro Gly Gin Ala Leu Tyr Ala Thr Gly Glu He He Gly Asp He 
675 680 685 



Arg Gin Ala His Cys Leu Gly Asn lie Ser Arg Ala Lys Trp Asn Asn 
690 695 700 



Thr Leu Lys Gin lie Asp Ser Lys Leu Arg Glu Gin Phe Gly Asn Asn 
705 710 715 720 



Lys Thr lie lie Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu lie Val 
725 " 730 735 



Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr 
740 745 750 



Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr Lys Gly 
755 760 765 



Ser Asn Asn Thr Glu Gly Ser Asp Thr lie Thr Leu Pro Cys Arg lie 
770 ~ 775 780 



Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala 
785 790 795 800 



Pro Pro He Ser Gly Gin He Arg Cys Ser Ser Asn He Thr Gly Leu 
805 810 815 



Leu Leu Thr Arg Asp Gly Gly Asn Ser Asn Asn Glu Ser Glu He Phe 
820 825 830 



Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr 
835 840 845 



Lys Tyr Lys Val Val Lys He Glu Pro Leu Gly Val Ala Pro Thr Lys 
850 855 860 



Ala Lys Arg Arg Val Val Gin Thr Ser Ala Val Gly He Gly Ala Leu 
865 870 875 880 



Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Cys Thr Ser 
885 890 895 



Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser Asp He Val Gin 
900 905 910 



Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Leu Leu 
915 920 925 



Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg lie Leu Ala 
930 935 940 



Val Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly lie Trp Gly Cys 
945 ~ 950 955 960 



Ser Gly Lys Leu lie Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp 
965 970 975 



Ser Asn Lys Ser Leu Glu Gin lie Trp Asn Asn Met Thr Trp Met Glu 
980 985 990 



Trp Asp Arg Glu lie Asn Asn Tyr Thr Ser Leu lie His Ser Leu lie 
995 1000 1005 



Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu 
1010 1015 1020 



Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn lie Thr 
1025 ~ 1030 1035 



Asn Trp Leu Trp Tyr lie Lys Leu Phe lie Met lie Val Gly Gly 
1040 1045 1050 



Leu Val Gly Leu Arg He Val Phe Ala Val Leu Ser He Val Asn 
1055 1060 1065 



Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu 
1070 1075 1080 



Pro He Pro Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu Glu 
1085 ~ 1090 1095 



Gly Gly Glu Arg Asp Arg Asp Arg Ser He Arg Leu Val Asn Gly 
1100 1105 1110 



Ser Leu Ala Leu He Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe 
1115 1120 1125 



Ser Tyr His Arg Leu Arg Asp Leu Leu Leu He Val Thr Arg He 
1130 1135 1140 



Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr Trp 
1145 1150 1155 



Trp Asn Leu Leu Gin Tyr Trp Ser Gin Glu Leu Lys Asn Ser Ala 



1160 1165 1170 

Val Asn Leu Leu Asn Ala Thr Ala He Ala Val Ala Glu 
1175 1180 1185 

/ 

<210> 56 
<211> 507 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 56 



atgttcttta 


gggaagatct 


ggccttccta 


caagggaagg 


ccagggaatt 


ttcttcagag 


60 


cagaccagag 


ccaacagccc 


caccatttct 


tcagagcaga 


ccagagccaa 


cagccccacc 


120 


agaagagagc 


ttcaggtctg 


gggtagagac 


aacaactccc 


cctcagaagc 


aggagccgat 


180 


agacaaggaa 


ctgtatcctt 


taacttccct 


cagatcactc 


tttggcaacg 


acccctcgtc 


240 


acaataaaga 


taggggggca 


actaaaggaa 


gctctattag 


atacaggagc 


agatgataca 


300 


gtattagaag 


aaatgagttt 


gccaggaaga 


tggaaaccaa 


aaatgatagg 


gggaattgga 


360 


ggttttatca 


aagtaagaca 


gtatgatcag 


atactcatag 


aaatctgtgg 


acataaagct 


420 


ataggtacag 


tattagtagg 


acctacacct 


gtcaacataa 


ttggaagaaa 


tctgttgact 


480 


cagattggtt 


gcactttaaa 


tttttaa 








507 



<210> 57 

<211> 168 

<212> PRT 

<213> Human immunodeficiency virus type 1 

<400> 57 

Met Phe Phe Arg Glu Asp Leu Ala Phe Leu Gin Gly Lys Ala Arg Glu 
1 5 10 15 



Phe Ser Ser Glu Gin Thr Arg Ala Asn Ser Pro Thr He Ser Ser Glu 
20 25 30 



Gin Thr Arg Ala Asn Ser Pro Thr Arg Arg Glu Leu Gin Val Trp Gly 
35 40 45 



Arg Asp Asn Asn Ser Pro Ser Glu Ala Gly Ala Asp Arg Gin Gly Thr 
50 55 - 60 



Val Ser Phe Asn Phe Pro Gin He Thr Leu Trp Gin Arg Pro Leu Val 
65 70 75 80 



Thr He Lys He Gly Gly Gin Leu Lys Glu Ala Leu Leu Asp Thr Gly 
85 90 95 



Ala Asp Asp Thr Val Leu Glu Glu Met Ser Leu Pro Gly Arg Trp Lys 
100 105 110 



Pro Lys Met lie Gly Gly lie Gly Gly Phe lie Lys Val Arg Gin Tyr 
115 120 125 



Asp Gin lie Leu He Glu He Cys Gly His Lys Ala He Gly Thr Val 
130 135 140 



Leu Val Gly Pro Thr Pro Val Asn He He Gly Arg Asn Leu Leu Thr 
145 " 150 155 160 



Gin He Gly Cys Thr Leu Asn Phe 
165 



<210> 58 

<211> 1800 

<212> DNA 

<213> Artificial sequence 
<220> 





<223> Gag- 


-Pi 












in 


<400> 58 
atgggtgcga 


gagcgtcagt 


attaagcggg 


ggagaattag 


atcgatggga 


aaaaattcgg 


60 




ttaaggccag 


ggggaaagaa 


aaaatataaa 


ttaaaacata 


tagtatgggc 


aagcagggag 


120 


ri 


ctagaacgat 


tcgcagttaa 


tcctggcctg 


ttagaaacat 


cagaaggctg 


tagacaaata 


180 




ctgggacagc 


tacaaccatc 


ccttcagaca 


ggatcagaag 


aacttagatc 


attatataat 


240 




acagtagcaa 


ccctctattg 


tgtgcatcaa 


aggatagaga 


taaaagacac 


caaggaagct 


300 




ttagacaaga 


tagaggaaga 


gcaaaacaaa 


agtaagaaaa 


aagcacagca 


agcagcagct 


360 




gacacaggac 


acagcagtca 


ggtcagccaa 


aattacccta 


tagtgcagaa 


catccagggg 


420 




caaatggtac 


atcaggccat 


atcacctaga 


actttaaatg 


catgggtaaa 


agtagtagaa 


480 




gagaaggctt 


tcagcccaga 


agtaataccc 


atgttttcag 


cattatcaga 


aggagccacc 


540 




ccacaagatt 


taaacaccat 


gctaaacaca 


gtggggggac 


atcaagcagc 


catgcaaatg 


600 




ttaaaagaga 


ccatcaatga 


ggaagctgca 


gaatgggata 


gagtacatcc 


agtgcatgca 


660 




gggcctattg 


caccaggcca 


gatgagagaa 


ccaaggggaa 


gtgacatagc 


aggaactact 


720 




agtacccttc 


aggaacaaat 


aggatggatg 


acaaataatc 


cacctatccc 


agtaggagaa 


780 




atttataaaa 


gatggataat 


cctgggatta 


aataaaatag 


taagaatgta 


tagccctacc 


840 




agcattctgg 


acataagaca 


aggaccaaaa 


gaacctttta 


gagactatgt 


agaccggttc 


900 



tataaaactc 


taagagccga 


gcaagcttca 


caggaggtaa 


aaaattggat 


gacagaaacc 


960 


ttgttggtcc 


aaaatgcgaa 


cccagattgt 


aagactattt 


taaaagcatt 


qgcraccaaccr 

_j ^3 **** w ** ^3 z3 


1020 


gctacactag 


aagaaatgat 


gacagcatgt 


c agggag t ag 


cracrcra c c cqq 


ccataaggca 


1080 


agagttttgg 


ctgaagcaat 


gagccaagta 


acaaatacag 


ctaccataat 


gatgeagaga 


1140 


ggcaatttta 


ggaaccaaag 


aaagatggtt 


aagtgtttca 


attgtggcaa 


agaagggcac 


1200 


acagccagaa 


attgcagggc 


ccctaggaaa 


aagggctgtt 


ggaaatgtgg 


aaaggaagga 


1260 


caccaaatga 


aagattgtac 


tgagagacag 


gctaatttct 


ttagggaaga 


tctggccttc 


1320 


ctacaaggga 


acrcrccaqqqa 


attttcttca 


gagcagacca 


gagecaacag 


ccccaccatt 


1380 


tcttcagagc 


agaccagagc 


caacagcccc 


accagaagag 


agcttcaggt 


c t gggg t aga 


1440 


gacaacaact 


ccccctcaga 


age aggagc c 


gatagacaag 


gaactgtatc 


ctttaacttc 


1500 


cctcagatca 


ctctttggca 


acgacccctc 


gtcacaataa 


aa a t a crcrcrcrcr 

*-*-3 3 3 333 


geaaefcaaag 


1560 


gaagctctat 


tagatacagg 


agcagatgat 


acagtattag 


aagaaatgag 


tttgccagga 


1620 


agatggaaac 


caaaaatgat 


agggggaatt 


ggaggtttta 


tcaaagtaag 


acagtatgat 


1680 


cagatactca 


tagaaatctg 


tggacataaa 


gctataggta 


cagtattagt 


aggacctaca 


1740 


cctgtcaaca 


taattggaag 


aaatctgttg 


actcagattg 


gttgcacttt 


aaatttttaa 


1800 



<210> 59 

<211> 599 

<212> PRT 

<213> Artificial sequence 



U <220> 

M> <223> Gag-PI 



<400> 59 

Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp 
15 10 15 



Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys 
20 25 30 



His He Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 
35 40 45 



Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin He Leu Gly Gin Leu 
50 55 60 



Gin Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 
65 70 ^ 75 80 



Thr Val Ala Thr Leu Tyr Cys Val His Gin Arg lie Glu lie Lys Asp 
85 90 95 



Thr Lys Glu Ala Leu Asp Lys lie Glu Glu Glu Gin Asn Lys Ser Lys 
100 105 110 



Lys Lys Ala Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val 
115 120 125 



Ser Gin Asn Tyr Pro He Val Gin Asn He Gin Gly Gin Met Val His 
130 ' 135 140 



Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu 
145 150 155 160 



Glu Lys Ala Phe Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser 
165 170 175 



Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly 
180 " 185 190 



Gly His Gin Ala Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu 

195 200 205 



Ala Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro He Ala 
210 215 220 



Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr 
225 230 235 240 



Ser Thr Leu Gin Glu Gin He Gly Trp Met Thr Asn Asn Pro Pro lie 
245 250 255 



Pro Val Gly Glu He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys 
260 265 270 



He Val Arg Met Tyr Ser Pro Thr Ser He Leu Asp He Arg Gin Gly 
275 280 285 



Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu 
290 295 300 



Arg Ala Glu Gin Ala Ser Gin Glu Val Lys Asn Trp Met Thr Glu Thr 
305 310 315 320 



Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Lys Ala 



325 



330 



335 



Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly 
340 345 350 



Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser 
355 360 3 65 



Gin Val Thr Asn Thr Ala Thr He Met Met Gin Arg Gly Asn Phe Arg 
370 375 380 



Asn Gin Arg Lys Met Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His 
385 " * 390 395 400 



Thr Ala Arg Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys 
405 410 415 



Gly Lys Glu Gly His Gin Met Lys Asp Cys Thr Glu Arg Gin Ala Asn 
420 425 430 



Phe Phe Arg Glu Asp Leu Ala Phe Leu Gin Gly Lys Ala Arg Glu Phe 
435 440 445 



Ser Ser Glu Gin Thr Arg Ala Asn Ser Pro Thr He Ser Ser Glu Gin 
450 455 460 



Thr Arg Ala Asn Ser Pro Thr Arg Arg Glu Leu Gin Val Trp Gly Arg 
465 470 475 480 



Asp Asn Asn Ser Pro Ser Glu Ala Gly Ala Asp Arg Gin Gly Thr Val 
485 490 495 



Ser Phe Asn Phe Pro Gin He Thr Leu Trp Gin Arg Pro Leu Val Thr 
500 505 510 



He Lys He Gly Gly Gin Leu Lys Glu Ala Leu Leu Asp Thr Gly Ala 
515 520 525 



Asp Asp Thr Val Leu Glu Glu Met Ser Leu Pro Gly Arg Trp Lys Pro 
530 535 540 



Lys Met He Gly Gly He Gly Gly Phe He Lys Val Arg Gin Tyr Asp 
545 550 555 560 



Gin He Leu He Glu He Cys Gly His Lys Ala He Gly Thr Val Leu 
565 570 575 



Val Gly Pro Thr Pro Val Asn He He Gly Arg Asn Leu Leu Thr Gin 
580 585 590 



He Gly Cys Thr Leu Asn Phe 





595 


<210> 


60 


<211> 


51 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


PCR primer 


<400> 


60 



aaatcaaccg gaattgaatt ccctcgggtg taccagacct aacaacaata c 



<210> 61 

<211> 42 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 61 

attgttgggt ctcgtacaac aatgtgcttg tcttatatcc cc 



<210> 62 

<211> 41 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 62 

ggggatataa gacaagcaca ttgtacgaga cccaacaata c 



<210> 63 

<211> 39 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 63 

gttgtagggc cttgtgcaac aatgtgcttg tcttatatc 



<210> 64 
<211> 39 
<212> DNA 



<213> Artificial sequence 
<220> 

<223> PCR primer 
<400> 64 

gatataagac aagcacattg ttgcacaagg ccctacaac 



<210> 65 

<211> 36 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 65 

ggtggagggt ctggtacaac aatgtgctct tcttat 



<210> 66 

<211> 36 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 66 

ataagaagag cacattgttg taccagaccc tccacc 



<210> 67 

<211> 47 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 67 

gtattgttgt tgggtcttgt acaacaatat gcttttctta tatctcc 



<210> 68 

<211> 47 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 68 

ggagatataa gaaaagcata ttgttgtaca agacccaaca acaatac 



<210> 69 

<211> 39 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> PCR primer 
<400> 69 

gttattaggt ctggtacaac aatgtgcctt tctgatgtc 3 9 



<210> 70 

<211> 39 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 70 

gacatcagaa aggcacattg ttgtaccaga cctaataac 3 9 



<210> 71 

<211> 54 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> PCR primer 

<400> 71 

aataaactag tctagacccc cgagtctaga acaatgtgct tgtcttatat ctcc 54 

<210> 72 

<211> 7 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MMLV SD site 

<400> 72 

aggtaag 7 



<210> 73 

<211> 9 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> MMLV SA site 

<400> 73 

ctgctgcag 9 



<210> 74 

<211> 90 

<212> DNA 

<213> Artificial 



sequence 



<220> 

<223> DNA encoding gpl20 signal peptide 

<400> 74 

atgagagtga aggagaaata tcagcacttg tggagatggg ggtggagatg gggcaccatg 
ctccttggga tgttgatgat ctgtagtgct 



<210> 75 
<211> 129 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> DNA encoding gp41 transmembrane domain 
<400> 75 

ttattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc tgtactttct 
gtagtgaata gagttaggca gggatattca ccattatcgt ttcagaccca cctcccaatc 



ccgagggga 



